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NOVEL PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

BACKGROUND OF THE INVENTION 

The invention generally relates to nucleic acids and polypeptides. More particularly, the 
invention relates to nucleic acids encoding novel molecules (MOL) polypeptides, as well as 
5 vectors* host cells, antibodies, and recombinant methods for producing these nucleic acids and 
polypeptides. 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences encoding 
novel polypeptides. The novel nucleic acids and polypeptides are referred to herein as MOLX, 
10 or MOL1, MOL2, MOL3, M0L4, MOL5, MOL6, MOL7, MOL8, MOL9, and MOL10 nucleic 
acids and polypeptides. These nucleic acids and polypeptide?;, as well as derivatives, homologs, 
analogs and fragments thereof, will hereinafter be collectively designated as "MOLX"nuoleic 
acid or polypeptide sequences. 

In one aspect, the invention provides an isolated MOLX nucleic acid molecule encoding 
1 5 a MOLX polypeptide that includes a nucleic acid sequence that has identity to the nucleic acids 
disclosed in SEQ ID NOS:i, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21. 23, 25, 27, and 29. In some 
embodiments, (he MOLX nucleic acid molecule will hybridize under stringent conditions to a 
nucleic acid sequence complementary to a nucleic acid molecule that includes a protein-coding 
sequence of a MOLX nucleic acid sequence. The invention also includes an isolated nucleic acid 
20 that encodes a MOLX polypeptide, or a fragment, homolog, analog or derivative thereof For 
example, the nucleic acid can encode a polypeptide at least 80% identical to a polypeptide 
comprising the amino acid sequences of SEQ ID NOS:2, 4, 6, 8 ; 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, and 30. The nucleic acid can be, for example, a genomic DNA fragment or a cDNA 
molecule that includes the nucleic acid sequence of any of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1 , 13, 15, 
25 17, 19, 21, 23, 25, 27, and 29. 

Also included in the invention is an oligonucleotide, e.g. 9 an oligonucleotide which 
includes at least 6 contiguous nucleotides of a MOLX nucleic acid (e.g., SEQ ID NOS:l , 3,5,7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29) or a complement of said oligonucleotide. 

Also included in the invention are substantially purified MOLX polypeptides (SEQ ID 
30 NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30). In certain embodiments, the 

1 



wo tnmxsiH 



PCT/l S01/13S78 



MOLX polypeptides include an Amino add sequence that is substantially identical to the amino 
acid sequence of a human MOLX polypeptide. 

The invention also features antibodies that immunoselectively bind to MOLX 
polypeptides, or fragments, homologs, analogs or derivatives thereof. 
5 In another aspect, the invention includes pharmaceutical compositions that include 

therapeutically- or prophylactically-effective amounts of a therapeutic and a pharmaceutically- 
acceptable carrier. Hie therapeutic can be, &g., a MOLX nucleic acid, a MOLX polypeptide, or 
an antibody specific for a MOLX polypeptide. In a further aspect, the invention includes, in one 
or more containers, a therapeutically- orprophylactically-eflfective amount of this 
10 pharmaceutical composition. 

hi a further aspect, the invention includes; a method of producing a polypeptide by 
cultuiing a cell that includes a MOLX nucleic acid, under conditions allowing for expression of 
the MOLX polypeptide encoded by the DNA. If desired, the MOLX polypeptide can then be 
recovered 

15 In another aspect, the invention includes a method of detecting the presence of a MOLX 

polypeptide in a sample. In the method, a sample is contacted with a compound that selectively 
binds to the polypeptide under conditions allowing for formation of a complex between the 
polypeptide and the compound. The complex is detected, if present, thereby identifying the 
MOLX polypeptide within the sample. 

20 The invention also includes methods to identify specific cell or tissue types based on their 

expression of a MOLX. 

Also included in the invention is a method of detecting the presence of a MOLX nucleic 
acid molecule in a sample by contacting the sample with a MOLX nucleic acid probe or primer, 
and detecting whether the nucleic acid probe or primer bound to a MOLX nucleic acid molecule 

25 in the sample. 

In a further aspect, the invention provides a method for modulating the activity of a 
MOLX polypeptide by contacting a cell sample that includes the MOLX polypeptide with a 
compound that binds to the MOLX polypeptide in an amount sufficient to modulate the activity 
of said polypeptide. The compound can be, e.g., a small molecule, such as a nucleic acid, 
30 peptide, polypeptide, peptidomtmctic, carbohydrate, lipid or other organic (carbon containing) or 
inorganic molecule, as further described herein. 

Also within the scope of the invention is the use of a therapeutic in the manufacture of a 
medicament for treating or preventing disorders or syndromes including, e.g. y diabetes, 
metabolic disturbances associated with obesity, the metabolic syndrome X, anorexia, wasting 
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disorders associated with chronic diseases, metabolic disorders, diabetes, obesity, infectious 
disease, anorexia, cancer^associatcd cachexia, cancer, neurodegenerative disorders, Alzheimer's 
Disease, Parkinson's Disorder, immune disorders, and hematopoietic disorders, ox other 
disorders related to cell signal processing and metabolic pathway modulation. The therapeutic 
5 can be, e.g., a MOLX nucleic acid, a MOLX polypeptide, or a MOLX-specific antibody, or 
biologically-active derivatives or fragments thereof. 

For example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from: Cancer including pancreatic cancer, adenoma, brain tumor, colon 
cancer breast cancer, prostate cancer, testis cancer, neurological disorders including age-related 

10 disorders, Alzheimer's disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, 
Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative 
disorders, Aneurysms, Fibromuscular dysplasia, metabolic disorders including, failure to thrive, 
nutritional edema, hypoproteinemia, trypsinogen deficiency disease, chronic and heriditary 
pancreatitis, enterkinase defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders 

15 mchising tachycardia, erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, 

Cardiomyopathy, Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial 
septal defect (ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, 
Subaortic stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night 
blindness, brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis^ 

20 Hypercalcemia, Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, 
Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 
immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhau syndrome, Multiple sclerosis, Transplantation, 

25 lung diseases, including asthma, Emphysema, imrnundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoplosis, viral, bacterial, and parasitic infections and/or other 
pathologies and disorders of the like. 

The polypeptides can be used as imnnmogens to produce antibodies specific foe the 

30 invention, and as vaccines. They can also be used to screen for potential agonist and antagonist 
compounds. For example, a cDNA encoding MOLX may be useful in gene therapy, and MOLX 
may be useful when administered to a subject in need thereof. By way of nonlimiting example, 
the compositions of the present invention will have efficacy for treatment of patients suffering 
from Cancer including pancreatic cancer, adenoma, brain tumor, colon cancer breast cancer, 
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prostate cancer, testis career, neurological disorders including age-related disorders, Alzheimer's 
disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, Behavioral 
disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative disorders, Aneurysms, 
Fibromuscular dysplasia, metabolic disorders including, failure to thrive, nutritional edema, 
5 hypoproteincmia, trypsinogen deficiency disease, chronic and heriditary pancreatitis, enterkmase 
defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders inclusing tachycardia, 
erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, Cardiomyopathy, 
Adierosolerusis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial septal defect 
(ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, Subaortic 

10 stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night blindness, 
brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, Hypercalceunia, 
Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, Tuberous 
sclerosis, Fertility, Hirschsprung's disease, Renal ariery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 

15 immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lcsch-Nyhan syndrome. Multiple sclerosis, Transplantation, 
lung diseases, including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections and/or other 

20 pathologies and disorders. 

The invention further includes a method for screening for a modulator of disorders or 
syndromes including, e.g. 9 Cancer including pancreatic cancer, adenoma, brain tumor, colon 
cancer breast cancer, prostate cancer, testis cancer, neurological disorders including age-related 
disorders, Alzheimer's disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, 

25 Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative 

disorders, Aneurysms, Fibromuscular dysplasia, metabolic disorders including, failure to thrive, 
nutritional edema, hypoproleinernia, trypsinogen deficiency disease, chronic and heriditary 
pancreatitis, enterkinase defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders 
inclusing tachycardia, erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, 

30 Cardiomyopathy, Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial 
septal defect (ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, 
Subaortic stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night 
blindness, brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, 
Hypercalcemia, Cirrhosis, angiogenesis and wound healing s blood pressure regulation, Trauma, 
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Tuberous sclerosis, Fertility, Hirschsprnngs disease. Renal arteiy stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA. nephropathy, 
immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, 
5 lung diseases, including aathma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders* 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections or other 
disorders related to cell signal processing and metabolic pathway modulation. The method 
includes contacting a test compound with a MOT,X polypeptide and dctenrmirng if the test 

1 0 compound binds to said MOLX polypeptide. Binding of the test compound to the MOLX 

polypeptide indicates the test compound is a modulator of activity, or of latency or predisposition 
to the aforementioned disorders or syndromes. 

Also within the scope of the invention is a method for screening for a modulator of 
activity, or of latency or predisposition to an disorders or syndromes including, e.g. 9 Cancer 

1 5 including pancreatic cancer, adenoma, brain tumor, colon cancer breast cancer, prostate cancer, 
testis cancer, neurological disorders including age-related disorders, Alzheimer's disease, Stroke, 
Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, Behavioral disorders, 
Addiction, Anxiety, Pain, nephropathy, neurodegenerative disorders, Aneurysms, Fibromuscular 
dysplasia, metabolic disorders including, failure to thrive, nutritional edema, hypeproteinemia, 

20 trypsinogen deficiency disease, chronic and heriditary pancreatitis, enterkinase defieciency, 

Hypercholesterolemia, Obesity, Diabetes, cardiac disorders inclusing tachycardia, erythroderma, 
long QT syndrome, heart block, Ataxia-telangiectasia, Cardiomyopathy, Atherosclerosis, 
Hypertension, Congenital heart defects, Aortic stenosis, Atrial septal defect (ASD) 5 
Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis* 

25 Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night blindness, brugada 

syndrome, Von Hippcl-Lindau (VHL) syndrome, Tuberous sclerosis, Hypercalcemia, Cirrhosis, 
angiogenesis and wound healing, blood pressure regulation. Trauma, Tuberous sclerosis, 
Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, 
Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, immune disorders including 

30 cell-mediated immunity disorders, Leukodystrophies, inflammation, Hyperthyroidism, 

Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, lung diseases, 
including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, Appendicitis, 
Autoimmune diseases, Systemic hipus erythematosus, developmental disorders, neural tube 
defects, modulation of apoptosis, viral, bacterial, and parasitic infections or other disorders 
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related to cell signal processing and metabolic pathway modulation by administering a test 
compound to a test animal at increased risk for the aforementioned disorders or syndromes. The 
test animal expresses a recombinant polypeptide encoded by a MOLX nucleic acid. Expression 
or activity of MOLX polypeptide is then measured in the test animal, as is expression ot activity 
5 of lbs protein in a control animal which recombinantly-expresses MOLX polypeptide and is not 
at increased risk for the disorder or syndrome. Next, the expression of MOLX polypeptide in 
both the test animal and the control animal is compared. A change in the activity of MOLX 
polypeptide in the test animal relative to the control animal indicates the test compound is a 
modulator of latency of the disorder or syndrome. . 

10 In yet another aspect, the invention includes a method for determining the presence of or 

predisposition to a disease associated with altered levels of a MOLX polypeptide, a MOLX 
nucleic acid, or both, in a subject (e.g., a human subject). The method includes measuring the 
amount of the MOLX polypeptide in a test sample from the subject and comparing the amount of 
the polypeptide in the test sample to the amount of the MOLX polypeptide present in a cunLrul 

15 sample. An alteration in the level of the MOLX polypeptide in the test sample as compared to 
the control sample indicates the presence of or predisposition to a disease in the subject 
Preferably, the predisposition includes, e.g.. diabetes, metabolic disturbances associated with 
obesity, the metabolic syndrome X, anorexia, wasting disorders associated with chronic diseases, 
metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated cachexia, 

20 cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 

disorders, and hematopoietic disorders. Also, the expression levels of the new polypeptides of 
the invention can be used in a method to screen for various cancers as well as to determine the 
stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 
25 pathological condition associated with a disorder in a mammal by administering to the subject a 
MOLX polypeptide, a MOLX nucleic acid, or a MOLX-specific antibody to a subject (c.g. , a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.#, Cancer including pancreatic cancer, 
adenoma, brain tumor, colon cancer breast cancer, prostate cancer, testis cancer, neurological 
30 disorders including age-related disorders, AJrfieimer's disease, Stroke, Parkinson's disease, 

Huntington's disease, Cerebral palsy, Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, 
nephropathy, neurodegenerative disorders, Aneurysms, Fibromuscular dysplasia, metabolic 
disorders including, failure to thrive, nutritional edema, hypoproteinemia, trypsinogen deficiency 
disease, chronic and heriditary pancreatitis, enterkinase defieciency, Hypercholesterolemia, 
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Obesity, Diabetes, cardiac disorders inclusing tachycardia, erythroderma, long QT syndrome, 
heart block, Ataxia-telangiectasia, Cardiomyopathy, Atherosclerosis, Hypertension, Congenital 
heart defects, Aortic stenosis, Atrial septal defect (ASD), Atrioventricular (A-V) canal defect, 
Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, Ventricular septal defect (VSD), valve 
5 diseases, Larsen syndrome, night blindness*, brugada syndrome, Von Hippel-Lindau (VHL) 
syndrome, Tuberous sclerosis, Hypercalcemia, Cirrhosis, angiogenesis and wound healing, 
blood pressure regulation, Trauma, Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal 
artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney disease, Renal 
tubular acidosis, IgA nephropathy, immune disorders including cell-mediated immunity 

10 disorders, Leukodystrophies, inflammation, Hyperthyroidism, Hypothyroidism, Lesch-Nyhan 
syndrome, Multiple sclerosis, Transplantation, lung diseases, including asthma, Emphysema, 
immundeficieneies, Crohn's disease, Scleroderma, Appendicitis, Autoimmune diseases, Systemic 
lupus erythematosus, developmental disorders, neural tube defects, modulation of apoptosis, 
viral, bacterial, and parasitic infections. 

15 In yet another aspect, the invention can be used in a method to identity the cellular 

receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 
Unless otherwise defined, all technical and scientific terms used herein have the same 

20 meaning as commonly understood by one of ordinary skill in the art to which this invention 

belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, suitable methods and materials are 
described below. AJ1 publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirely. In the case of conflict, the present 

25 specification, iiKrluding definitions, will control In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The invention is based, in part, upon the discovery of novel nucleic acid sequences that 

encode novel polypeptides. The novel nucleic acids and their encoded polypeptides are referred 
to individually as MOL1, MOL2, MOL3, MOL4, MOL5, MOL6 : MOL7, MOU, MOL9, and 
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MOL10. The nucleic acids, and their encoded polypeptides, are collectively designated herein a s 

The novel MOLX nucleic acids of the invention include the nucleic acids whose 
sequences are provided in Tables 1 A, 2A, 3A, 4A, 5A, 6A, 6E>, 7A, SA, 8D, 9A, 9D, 9F, and 
5 10A. inclusive ('Tables 1 A - 10A")» or a fragment, derivative, analog or homolog thereof The 
novel MOLX proteins of the invention include the protein fragments whose sequences are 
provided in Tables IB, 2B, 3B, 4B 5B, 6B, 6E, 7B, SB, 8E, 9B, 9E, 9G and 10B inclusive 
("Tables IB - lOB**). The individual MOLX nucleic acids and proteins are described below. 
Within the scope of this invention is a method of using these nucleic acids and peptides in the 
1 0 treatment or prevention of a disorder related to cell signaling or metabolic pathway modulation. 



MOU 

A disclosed interleukm- 1 roceptor/Toll-like nucleic acid of 1 050 nucleotides, MOL1 , is 
shown in Table 1 A. The disclosed MOL1 open reading frame ("ORF") begins at the ATG 
initiation codon at nucleotides 1-3, shown in bold in Table 1 A. The encoded polypeptide is 
1 5 alternatively referred to herein as MOL1 or as GMJ79960175. The disclosed MOL1 ORF 

terminates at a TGA codon at nucleotides 3043-3045. As shown in Table 1 A the start and stop 
codons are in bold letters. 



Table 1A* MOL1 nucleotide sequence (SEQ ID NO:l). 

"atgcTggcc^^ 

CTGCAACT GGCTGT T CCTG AAGTCTGT GCCCCACCTCTCCAT GGC ACCCCvTTGGCAATGTCA CC AGCCiTTT CCT 

TGTCCTCCAACCGC&TCCACCACCTC^^ 

TGGAACTGCCC^CGGTTGGCXTCAGCCC^ 

TG TGCCCACCCTGG AAGAGCT AAACClCAGCTACi^C AACATCAT GACTGTGCCTGCGCT GCCCAAATCCCTCAT AT 
CC CT GTCCCTCAGCCAT ACCAACATCC T GATGCT AG ACTC T GCCAGCCT CGCCGGCCTGCATGCCC TGCGCl'TCCT A 
XX CAT SGACGGC AACT GT VAT TAUAAG AACCCC?T GC AGGC AGGCACTG GAGGT GGCCCCGG GTGCCCTCCT T GGCCT 
GGGCAJVCCIX!!ACCCACCTCrTIAOro 

A3 CTSCTGT TGTC CTACAACCGCATCGT C AAACT GGCGCCTGAGGACCT GGC CAATCTGAC CGCCCT GC GT 3TCJCTC 
GATGl 1 GGGC^ tAAATTG CCGCCGCTGCGAC CACGC?CCCAACCCCTGCAT GG AGTGCCCTCGTCACT TCCC C CAGCT 
AC ATCCCGAT ACCTTCAGCC ACCT GAGCC GTCTT GAAGGC CTGGT GTTGAAG 3ACAGT TCT CTCT C CTGGCTGAAT G 
CCAGT TGGT T CCGTGGGCT GGGAAACCT C CGAGT GCT GGAC CTGAGTGAGAACTTCC1 CT ACAAAT GCATCACTAAA 
ACCAAGGCCrTCC AGGGCC T AAC AC AGCTGCGC AAGCTC AACCTG TCC T ? CAATT ACC AAA AGAG GG TG TCCTTTGC 
CCACC TGTCrCTGGCCCCT T CC TT CGGGAGCCTG GT CGCC CTGAAGG AGC T GG AC ATG CAC GGCATCTTCTTCCGCT 
CACTCGATGAGAC CACGC T CC ^CCACTG G CC C G CCT GCCCACGC TCCAGACTCTrGCGTCT GCAGA'J GAACTTCA3 C 
AACCAGGCC^tXTCGGCATCTTOU^CCTTCCCT 

AGCTT CGGA3CTGACAG CCACCAT 3GGGGAGG CAGATGGAGGGGAGAAGG TCTG GCTGCAG C CT GGGGACCTTGCTC 
CGGCC CCAGTGGAC ACT CCCAGCT 2 T GAAGACTTC AG GCC CAACTCCAC CACCCT CAAC TTCACCT tttGAT CTGTCA 
CG GAACAACCTGGTGACCGT GC AG C C GGAGAT GTTT GCCC AGCTCT CG CACCTGCAGTGCCn'GCGCCT G AGCCACAA 
CTGCATCT CGCAG GCAGTC AATGG 2 T CCCAGTTCCT GCCGCTGACCGG T C-GC AG GTGC 3 AG ACCTGTCCC ACAAT A 
AGCTG GACCTCTACCACGAGC ACT CA.T^A(^GAGCT ACCACCAClGCAGCCCCL' G GACCT CAGC TACPACAGCCRG 
CCCTT T GGCATGC AGGGCGT G 3GC CACAACTTC AGCT TCGT GGCT CACCTGC.GC ACCCTGC GCCACCTC AGCCTGGC 
, CC ACA AC AACATCCACAG C CAAGT GT CCCAGCAG CT CTGCAGTAC GTCGCT GCGGGCCCTG GACT TCAG CGGCAATG 
C ACT G GGC C^TATGTGGGCCGAGGGAJGACCTCTATC'J GCACT TCTTCCAAGGCCT-^AGC GGTT7GATCTGG CT GGAC 
TT GTC CCAGAACCG CCTGCACACC CT CCTGCCCCAAACCC TGOGCA ACCTCC CCAAGAG CCTAC AGGT GCT GCGTC I 
; CC GFTG ACAATTACC TGGCCT T CTT T AAGTGGTGGAGCCTC CACTTCCTGCCC AAAC TGGAAGTCCTCGAOC T GGCAG 
j G AAAC CAGCTCAAGG CCC?GACCAATGX3CAGCCTGCC TGC T GGC ACCC G GCT CCG 2AGG CTGGATG'JCAGCT GCAAC 
f AGCAT C AGCT TCGTGGC CCCCGGCTT CTT TTC C AAGGCC^.GSAGCTGCGAGA.GCTCAACCT'?AGCGCC AAC G CCCT 
| CAAGAC AG T GGAC CAC T CCTGGTT TG GGCCCC T GGCC-AGTGCCCTGC AAACAC TAGATGTAAGCGCCAACCC TCTGC 
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ACTGCGCCTGTGG^CGGCCTTTATGGACTTCCTGCTGGAGSTGCAGGCTGCCGTGCCCGGTCTGCCCAGCCGGGTG 
AAGT GV GGCAGTCCGGGCC AGCTC CAGtiGCCTCAG CAT CT TT GCACAGGACCT GI SCCTCTGCn TGGATGAGGCCCT 
CTPCCTGtSGACTGTTTCGCCC^CTCG^ 

GC1 GGGAC CTC TGGTACTGCC 1 TCCACXTTGl'G C Cf GGC C I GGCI 1 CCCTGGCGGGGG^GGCAAAG^GGGCGAGATGAG 
GATG^XXil'GL't^r/CGA^SCClTCG^rL^i'CT T CGAC AAAACGCAGA5CGCAGTGGCAG ACT GGG7GT AC AACGAGCT 

AAACCCT C iTTGAGAACCl'GT GGGCCTCGGT CTATGGCAGCC SCAAGACGCTGTrTSTGCT GGCCCACACGGACCGG 
GT CAGTGGTCT CTTt : CGCGCC AG C T7 CCTGCT GGCCCAGCAGC GCCTGCl'GGAGGACCGCAAGGACGTCGTG GTGC T 
GGTGATCCTGAGCTClGACGGm^^TCCCG^^ 



TTCTATAACCGGAACT TCTGCC&G GG ACCCAC GGCCGAAtAC 



. . A disclosed encoded MOL1 protein has 3,46 amino acid residues, referred to as the . 
MOL1 protein. The M0L1 protein was analyzed for signal peptide prediction and cellular 
localization. SignalP results piredic t that MOL1 is cleaved between position 4 1 and 42 of SEQ 
ID NO:2. Psort and Hydropathy profiles also predict that MOL1 contains a signal peptide and is 
likely to be localized in the plasma membrane (Certainty=0.4600). The disclosed MOL1 
polypeptide sequence is presented in Table IB using the one-letter amino acid code. 



Table IB. Encoded MOL1 protein seqoeuce (SEQ ID NO:2). 

MLAMTliAL GTLPAFLPCE1 HGLV>JC>s T NLFLKS V PH FS MAAPRGN VT SLSLSSNRIHHIiKUS DFAHLPSLHHLNLK 
WNCPPVGJaS PMEjr'FCKK'i'l tli= ST FLAVPTLE ELNL S YNN IMT V2AX PKSL3 S LS LS ffTNILMLDSASLAGLHALRTL 
FMD^C YYKN PCRQAL E VAPGALLGLSNLT KLSL KYNNLT WP RNL PSSLEYLLLS YNR I VKLAPE DZANLT ALRVL 
nVGGNCRRCDHAFU P'CMEC PRHT PQIrfiFDTFSKLS RLEGL VLKDS SLSWLNA5 W FRGliSN LRVL DL5EH FL YKC ITK 
TKAFQGLT QLRKLNL £ FNYQKRV S FAHLSL AF5 FGS LVAL KE LDMHGI FrRSLDET TLtRz LARL PMLCT LRLQMNFI 
NQAQLGI FRAFPGLRYVDL3 CHRI 3GA3EL IATMGEADGGEKaHSILQ PGDLA? AP VDTPSS E DFR PCTCSTLN FT LDL S 
RNNLYT VQ PEMFAQL £ H LQCZi RliS HN2ISCAVNG S QTL PLTG L QVLD1-S HtTKLDL YHE HS PTE L PRLEAL3LS YN SQ 
P FGXQG VG HN FS FVAK I RTLRHL S LAHNN I HS QVS QQLCST5 tiftALD FS GNALG HMWABG D J iY.T iH FFQGI ■ S GLI HL D 
IiS QNRLHTLLPQT I 1RNT1 P KS XiQVT^LRDNYIAFFKWWSL H FLPKLEVLDItAGNQLKALTNGS L P AGTRL RRL DVS CN 
. 5 1 S FVAPG F F5 KAKE LRELNLS ANALKTVDHSK FG PLAS ALU, I LDV5AN PLHCACG AAFM DFLL E VQAAY 7GL P 3RV 
KCGSPGQI^GI^IFAQDLRLCLDEALSWIXIF^ 

DAL FYDAFWFDKTCS AVADWV YNEL RGQLEECRGRWALR^CLEERDKLPGKTL ?ZNLtf ASVYG5 RKTL FVLAH T DR 
VSGLLRAS FLLACjQRLIiE DRKDVWLVT LS P CGRRSHYVRI^QRLCRQ SVLLWPHC PS GCRS FWAQLGMALTRONilii 
frYWiWFCC.GPTAE 



10 MOL1 was initially identified on chromosome 3 with a TblastN analysis of a proprietary 

sequence file for a G-protein coupled receptor probe or homolog, which was run against the 
Genomic Daily Files made available by GenBauk. A proprietary software program (GenScan™) 
was used Lo further predict the nucleic acid sequence and the selection of exons. Ths resulting 
sequences were further modified by means of similarities using BLAST searches. The 

15 sequences were then manually corrected for apparent inconsistencies, thereby obtaining the 
sequences encoding the folWength protein. 

A region of the MOL1 nucleic acid sequence has 690 of 1 203 bases (57 %) identical to a 
Homo sapiens Toll Receptor mRNA (GENBANK-ID: AL1 37451), with an E-value of 5.7xlO" 8 . 
In all BLAST alignments herein, the "E-value ,? or "Expect" value is a numeric indication of the 

20 probability that the aligned sequences could have achieved their similarity to the BLAST query 

sequence by chance alone, within the database uoat was searched. For example, Qie probability 

9 
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that the subject G'Sbjcf) retrieved from the MOL1 BLAST analysis, eg. , the Homo sapiens 
MOL, matched the Query MOL1 sequence purely by chance is 5 .7x10* 

A BLASTX search was performed against public protein databases. The full amino acid 
sequence of the protein of the invention was found to have 342 of 900 amino acid residues (38%) 
5 identical to, and 493 of 900 residues (54%) positive with, the 1049 amino acid residue Toll-like 
Receptor 7 protein from Homo sapiens rptnrSPTREMBL-ACC: AAF60188) . 

The amino acid sequence of MOL1 also had high homology to other proteins as shown m 
""" table 1C. "' 



Table 1C. BLAST results for MOL1 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 


Positives 
IS) 


3xpect 


Patp:W86365 


DNAX toll-likc 
receptor 
1C CHcano 
sapiens] 


336 


335/336 
(99%) 


335/33S 
(99%) 


0.0 


gi 18394 456 ]ref |NP 
_059138.1| 


toll-like 
receptor 9 
(3ono sapiens] 


1032 


960/1014 
(94%) 


960/1014 
(94%) 


0.0 


gi|136436651ref IX 
P_C03235.2| 


toll- like 
receptor 9 
[HoTTio sapiens 1 


1014 


960/1014 
(94%) 


960/1014 
(94%) 


0.0 


gi|8099654|gb|A?LF 
72190. 1)AF259263 
1 


.toll-like 
receptor 9 form 
B [Homo 
sapiens] 


975 


921/975 
(94%) 


921/975 j 
(94%) 


0.0 


gill35Q71731gh|AA 
K28488.1IAF314224 
1 


toll-like 
receptor 9 ~Kus 
HQS cuius ] 




720/1015 
(70%) 


7S9/101S 
(77%) 


Q.Q 



10 

A ClustalW analysis comparing disclosed proteins of the invention with related OR 
protein sequences is given in Table ID, with MOL1 shown on line 1 , 

In the ClustalW alignment of the MOLl protein, as weD as all other ClustalW analyses 
herein, the black outlined amino acid residues indicate regions of conserved sequence (i.e., 

15 regions that may be required to preserve structural or functional properties), whereas non- 
highlighted amino acid residues are less conserved and can potentially be mutated to a much 
broader extent wilhout altering protein structure or function. Residue differences between any 
MOLX variant sequences herein are written to show the residue in the "a" variant and the residue 
position with respect to the "a" variant. MOL residues in all following sequence alignments that 

20 differ between the individual MOL variants are highlighted with a box and marked with the (o) 
symbol above the variant residue in all alignments herein. 



10 
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Table ID. ClustalW Analysis of MOL1 

3) Novel MOL1 (SEQ TO N0:2> 

2) gi | 83944561 ref |np_059138.1| toll-iifce recep-or 9 IRomo sapiens] ;seq id 
N0:3I) 

3) gn. 11. 1^4 8665 1 ref ]XP_O03236.2] tolX-1 ike receptor 9 [Homo sapiens] {SEQ ID 
NO:32) 

4) gi|B0y9554l3brAAF72190.1|AF2592$3_l toll-like receptor 9 form D [Homo 
sapiens] (SEQ ID KO:33) 

5) gi|13507173]gb|AAK28488.1|AF314221_l toll-like rec-p-or 9 [Mus musculus] 
(SEO ID NO: 34) 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



•proi ■ 

NP_059138. 
3W_003236. 
AF259263 1 



M , 31C22d]j MVLRKHTT.H? LSLLVQAAjvQ 



PROl 

NP D591S0. 
XP~033236. 
AF259263_1 
AP314224 1 



PROl 

KP_059138. 
XP_003236. 
AF259263J. 
AF314224 1 



PROl 

NP_059138. 
XP 003236. 
ZF25Z2S3JL 
AF314224 1 



PROl 

MP_059X38. 
XP_003236 . 
AF259263_1 
AF314224 1 



PROl 

NP 059130, 
XPJ>0323«. 
AF2592S3 1 
JU?314224~1 



PROl 

HP 059138 . 
XP_O0323< . 
AF2S9263_1 
AF314224 1 



PROl 

NPJ)59138 . 
XPJJ03236 . 
AF259263_1 
AP314224 1 




490 



soo 



S10 



520 



530 



540 
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PROl 

HP_0S9138 . 
XBJ>03236 . 
&F2532fi3_l 
&F314224 1 



10 



15 



20 



25 



30 



35 



40 



45 




PROl 

HP 055138, 
XPJJ03236 . 
AF2592S3 1 
AF314224~1 



PROl 

HP_0 59138. 
XP 003235. 
AT259263_1 
AF314224 1 



PROl 

WP_0 59138. 
XP_0 03236. 
ara59263_l 
AP314224 1 



50 



55 



PROl 

NP_059138. 
XP_003236. 
AF2S92€3_1 
AP314224 1 



850 



860 




890 



909 
■ - I 



N 


bIL.L-1COI.EECF 


881 




ELRGQLEECF 


699 




KliKGCfLEECH 


881 






842 


N 




859 



60 



65 



70 



75 



310 



PROl 

KP_059138 . 
XP_O03236. 
&F259263_1 
AF314224 1 



PKOl 

NP_05913B. 
XPJJ03236. 
AP259263_1 
AP314224 1 



PROl 

NP 059138. 



920 



930 




J. 



940 

. . 7 . 



950 
• J 



960 



970 







'VLV 


EDRKDV 


'VLV 


EDRKDV* 


'VLV 


tiORJ&DV 


fVLV 




'VLV 



1030 
I 



3RCTLFVLA-1 
SRKPLFVLAH 
SEKPLFVIAH 
SRRTLFVIAH 



900 
■ I 




. I . 



950 
..I . 



ifVfi7.RQP.LC5 
"iVRLttCtliCj? 
ZVRLRQRI.C? 

OTP.LRORIC? 




1014 
1032 
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XPJ103236. fflxSN^Eg 1014 
AP25»263_1 B^B3 B * 75 
W31422-l_l gfCQN^Bl ES3 1032 



5 

The interleukin-1 (IL-1) receptor/Toll-like receptor (TLR) superfamily is a recently 
defined and expanding group of receptors that participate in host responses to injury and 
infection. The superfamily is defined by the ToML-1 receptor (TIR) domain, which occurs in 
the cytosolic region of family members, and is further subdivided into two groups based on 

10 homology to either the Type I IL-l receptor or Drosophila Toll receptor extracellular domain. 
The former group includes the receptor for the important Thl cytokine IL-1 8, and T1/ST2, 
which may have a role in Th2 cell function. The latter group includes six mammalian TLRs, 
including TLR2 and TLR4, that largely mediate the host response lo gram-positive and gram- 
negative bacteria, respectively. Whether bacterial products are actual Hgands for TLRs, or 

15 whether they generate ligands via as yet unidentified pattern recognition receptors, has yet to be 
determined. Signaling pathways activated via the TIR domain trigger the activation of 
downstream kinases, and transcription factors such as NF-kappaB, and involve the adaptor 
protein MyD88, which itself contains a TJR domain. 

As our primary interface with the environment, the skin is constantly subjected to injury 

20 and invasion by pathogens. The fundamental force driving the evolution of the immune system 
has been the need to protect file host against overwhelming infection. The ability of T and B cells 
to recombine antigen receptor genes during development provides an efficient, flexible, and 
powerful immune system with nearly unlimited specificity for antigen. The capacity to expand 
subsets of antigen-specific lymphocytes that become activated by environmental antigens 

25 (memory response) is termed "acquired" immunity. Immunologic memory, although a 

fundamental aspect of mammalian biology, is a relatively recent evolutionary event that permits 
organisms to live for years to decades. "Innate" immunity, mediated by genes that remain in 
germ line conformation and encode for proteins that recognize conserved structural patterns on 
microorganisms, is a much more ancient system of host defense. Defensins and other 

30 antimicrobial peptides, complement and opsonins, and endocytic receptors are all considered 
components of the innate immune system. None of these, however, are signal-transducing 
receptors. Most recently, a large family of cell surface receptors that mediate signaling through 
the NF-kappaB transcription factor has been identified. This family of proteins shares striking 
homology with plant and Drosophila genes that mediate innate immunity. In mammals, this 

35 family includes the type I interleukin-1 receptor, the interleukin-1 8 receptor, and a growing 
family of Toll-like receptors, two of which were recently identified as signal-transducing 
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receptors for bacterial endotoxin. In this review, we discuss how interleukin-1 links Oie innate 
and acquired immune systems to provide synergistic host defense activities in skin. 

In Drosophila the Toll protein is involved in establishment of doTso-vcntral polarity in 
the embryo. In addition, members of the Toll family play a key role in innate antibacterial and 
5 antifungal immunity in insects as well as in mammals. These proteins are type-I transmembrane 
receptors that share an intracellular 200 residue domain with the interleukin-1 receptor (IL-1R), 
the Toll/EL-IR homologous region (TIR). The similarity between Toll-like receptors (LTRs) and 
IL-1R is not restricted to sequence homology since these proteins also share a similar signaling 
pathway. They both induce the activation of a Rel type transcription factor via an adaptor protein 

10 and a protein kinase. Interestingly, MyD88, a cytoplasmic adaptor protein found in mammals, 
contains a HR domain associated to a DEATH domain (see IPR000488 ). Besides the 
mammalian and Drosophila proteins, a TFR domain is also found in a number of plant proteins 
implicated in host defense. As MyD88, these proteins are cytoplasmic. Site directed mutagenesis 
and deletion analysis have shown that the TIR domain is essential for Toll anil IL-1R activities. 

1 5 Sequence analysis have revealed the presence of three highly conserved regions among the 
different members of the family: box 1 (FDAFISY), box 2 (GYEXC-RD-PG), and box 3 (a 
conserved W surrounded by basic residues). It has been proposed that boxes 1 and 2 are involved 
in the binding of proteins involved in signaling, whereas box 3 is primarily involved in direc ting 
localization of receptor, perhaps through interactions with cytoskeletal elements 

20 Toll is a Drosophila gene essential for ontogenesis and antimicrobial resistance. Several 

hortologues of Toll have been identified and cloned m vertebrates, namely Toll-like receptors 
(TLR). Human TLR are a growing family of molecules involved in innate immunity, TLR are 
structurally characterized by a cytoplasmic Toll/interleukin-lR (TIR) domain and by 
extracellular leucine-rich repeats. TLR characterized so far activate the MyD88/IRAK. signaling 

25 cascade, which bifurcates and leads to NF-kappaB and c-Jun/ATF2/TCF activation. Genetic, 
gene transfer, and dominant-negative approaches have involved TLR family members (TLR2 
and TLR4) in lipopolysaccharide recognition and signaling. Accumulating evidence suggests 
that some TLR molecules are also involved in signaling receptor complexes that recognize 
components of gram-positive bacteria and mycobacteria. However, the definitive role of other 

30 TLR is still lacking. A systematic approach has been used to determine whether different human 
leukocyte populations selectively or specifically expressed TLRmRNA. Based on expression 
pattern, TLR can be classified as ubiquitous (TLR1), restricted (TLR2, TLR4, andTLRS), and 
specific (TLR3). Expression and regulation of distinct though overlapping figand recognition 
patterns may underlie the existence of a numerous, seemingly redundant, TLR family. 
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Alternately, the expression of a TLR in a single cell type may indicate a specific role for this 
molecule in a restricted setting. 

The arnino acids differences between the three MOLL proteins are shown in Table 1H. 
Deletions are marked by a delta (A). The differences between the three proteins appear to be 
5 localized to a few distinct regions. Thus* these proteins may have similar functions, such as 
serving as olfactory ox chemokine receptors. 
Uses of the Compositions of the Invention 

The above defined information for this invention suggests that this Toll Receptor-like 
protein may function as a member of a "Toll Receptor family". Therefore, ihc novel nucleic 

10 acids and proteins identified here may be useful in potential therapeutic applications implicated 
in (but not limited to) various pathologies and disorders as indicated below. The potential 
therapeutic applications for this invention include, but arc not limited to: protein therapeutic, 
small molecule drag target, antibody target (therapeutic, diagnostic, drug taigeting/cy toloxic 
antibody), diagnostic and'or prognostic marker, gene therapy (gene delivery/gene ablation), 

15 research tools, tissue regeneration in vivo and m vitro of all tissues and cell types composing (but 
not limited to) those defined here. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in pancreatic cancer, adenoma, and other cancers, Larsen syndrome, 
tachycardia, erythroderma, night bKndness, long QT syndrome, brugada syndrome, heart block, 

20 cell-mediated immunity, and applications as a mediator in inflammation and/or other pathologies 
and disorders. For example, a cDNA encoding the Toll Receptor-like protein may be useful in 
gene therapy, and the Toll Receptor-like protein may be useful when administered to a subject m 
need thereof. By way of nominating example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from pancreatic cancer, adenoma, and other 

25 cancers, Larsen syndrome, tachycardia, erythroderma, night blindness, long QT syndrome, 
brugada syndrome, heart block, cell-mediated immunity, and applications as a mediator in 
inflammation. The novel nucleic acid encoding Toll Receptor-like protein, and the Toll 
Receptor-like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

30 These materials are fiirther useful in the generation of antibodies that bind rmmxmospecificalry to 
the novel substances of the invention for use in therapeutic or diagnostic methods. 
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MOL2 

An additional murine GNC2 elFK -like protein of the invention, referred to herein as 
MOL2, is an Olfactory Receptor COR")-like protein. The novel nucleic acid of 4989 
nucleotides, (20466828_RXT1 3 SEQ ID NO:3) encoding a novel GNC2 elFK-lilce protein is 
5 shown in Table 2A. 



Table 2A. MOL2 Nucleotide Sequence (SEQ ID NO:3) 

"mg^igggggccgtggggccccc^g 

gct acaggcc ctg3aggc c ai?ct acggcgcggacttc caag acct 3cg3c cggacgct t gcggaccggt t aaggt ca 
aagag ccccc tglaaatcaatttagttttot accc^^ t ggatt tg 

AGGGT T AAAT GCCCACCi'ACCTATCC^VTtTriaGl'l'C CTG ArJ^TASAGTIi^iA/ATGCCA^GGTCT AT CAAAT GA 
AAGTCTCAAT TTGTTAAAATCT CGCCTAGAAC3>ACTGGCC AAGAA ACACTGT GGGGAGGTAGTGAT GATCT TT GAAC 
ZX3<OTTACCACGTGCACTCATTTC^ 
AG&CGGGC^AAIiWHGAGCAAa^^ 

TGAGATTOAG AGA AGGAA AGAAGAGA T AAAAGA Af- AG A A A A AAGh? AA AG A A A TGGCTA A3C A GGA ACG"T TGG A AA 
TT GCT AGT T T GTC AAACC AAGAT CAT ACCTCT AAGAA GGACCCAGGAGGACAC AGAAC GGZTGCCATT CT A CAT GGA 
GGCTCTCCTX3ACTT1'G:!AGGAAATGGT^^^ 

GT ATTGTGTATGT AATAGT GAAGATT C!TCCTGGCT CT TGTGAAAT TCTGTATTTCAATAT 3GGGAGTCCT GATC AGC 
TCATG GTGCACAAAGGGAAATGTATT GGCAGTGAT GAAC AACTI GGAAAATTAGTC T ACAATGCTTTGGAAAC AGCC 
AC'l GG TGGCTTl'G'l'CTTGT T GTATGAGl'GGGTC CI TC AGT GGCAG AAAAAAAJ GGGT C CAT TCCT TACC AGT C AAGA 
AAA_AGAGAAGATT SATAAG TCC AA AAAGCAGAT T CAA GG AACAGAAAC AG AAT T CA AC T CACT GGT AAA AT TG AGCC 
ATCCTJJlTGTAGrTUZGCTACCTTGCAArGAATTC^ 
CACATXAG*i'GGGGTCTCTCl"rG<X'GC^^ 

A3CTCAGCTCCTGT CJAGGCCTTGATTATCTGCACAGCAATT CTGTGGTGCATAAGGTCCTSAGTtSCAT CTAAT GTCT 
T^GGATGCAGAAGGCRCCGTCAAGATTACGGACTAIAGCATTTC 

GT GT 1 TGAGCAAACCCGAGTT CGT T TT AGTGACAATGCTCT GCC T T AT AAAACGGG GAAG AAAGGAGAT GTT T GGCG 
TCTTGGC CT T CTGC TGCTGTCCC*? CA GCCAAGGACAGGAAT GTGGAGAGTACCCTGTGAC^AT CCCTAG7G ACT T AC 
CAGC1 GACTT TCAAGATTTTCTAAAG AAG AGATG I GT GTGCTTG^ITX^CAAGG AAAGAT 3GAGTCCCCAGCAGTT3 
T T GAAACACAG CTT T ATAAAT C CCCAG CCAAAAAT GCCTCT AGT GGAACAAAGTCCT G AAI CT GAAGGAC AAGAT TA 
TGT TGAGACT GTTATTCCT AGCAACC GGCTACCC AGTGCT GCCT T CTTTAGTG AGAC AC AGAG AC AGT T V T CCCGAT 
ACTT CATTGAGTTTGAAGAAT T ACAACTT CT TGGTAAAGG AGCTTT TGGAGCTGTCATC7 lAGGT GC AGAACAAGTT 3 
GACGS^TCXTTGCTACGCAGTCAAGCGCATCCC^ 

GACAC TGCT5T C ACGGCT GC ACCATGAGAACATTGT GCGCT ACTACAACGCCT GX^TCGAGCGGCACGAGCGGCCGG 
OaSGACCGGGGACGCCGCCCXXC-GACTC^ 

GACAC AGACGGCCTGGACAGC GT AGAGGC CG CCGCGC CGC C ACCC AT CCTCAG CAG CT CGG T GG AG TGG A.GC ACTTC 
GGGCGAGCG CT CGG CCAGTGGCCGTTTCCUXGCCACCGGCC C GG3 CTCCAGCGATGACGAGGACGACGACGAGGAC3 
A3CA£GGT<JGCGTCTT;CTCCCAGTCCTTCCTt3C^ 
GAGAJ>jCACTAAAAG1X^AATCAGGATG^ 

GACTS ACGCT GTG C ACTACCT AT AC ATCC AG ATGGAG T ACT G T GAGAAGAGCACTT TACGAGAC ACCATT GA^ C AGG 
GACTGIATC3A.GACAOCGT CAGACT CTGGAGGCTTT TTCG AGAGAT T CTGGATGGATT AGCTTAT ATCCA,TG AGAAA 
GGAAT GATT CACCGSGAT T TX5AAGCCT GTCAACATT TTTTT GGATTCT GATGACCATGTGAAAAT AGGTGA7 T T TG3 
TTTGGCGAC AG ACC ATCT AGCCTTTT C TGCTGACAGCAAAC AAGACG^ TAAGTCAGACC 
CTTCAGGTCACT TAACTGGGATGGTTGGCACrGC'l CT CTAT GTAAGCCCftGAGGl'CC AAG3AAG CACC AAATCT GCA 
T ACAACCAGAAAGT GGATCTCTTCAG CCTGGGAATTATCTTCTTTGAGATGTCCT AT C ACCCC ATC-GTCA.CGGCTTC 
ASAJ*AGGATCTTTOT^raCAACCAACTCA^ 

AI C CAPA.GCAGAAATCAG TCAT CT CC TG^TGTTGAAC CACGAT CCAGCAAAACGGCCCACAGCCACAGAACT GCT Z 
AAG AGTG AG CTCCT 3CCCCCACCCCAGAT GGAG^ AGTCAG AGCTGC ATGAAGTGCTGC ACC ACA.CGCT GACCAACGT 
GGATGGGAAGGC^TACCGCACCATGATGGCCCAGAT CTTC T CGC AGCGCATCTCCCCT GCCATCGATTACAC CT ATS 
AC AGCGAC AT AClX]iAAGCGCAA(^T CT CAATCCG 1ACAGC C AAG/iT GC AGCAGCAT G T GTG TCAAACC AT CAT CCG C 
ATCT I TAAAAGACATGGTGOTGT^CAGTTGTGTACTCC ACT ACT 
C^AAGCTGCCCPATTCATGGACCACAG^ 

AT GT GG CAAGAAAT AAT;ATATT GAAX T TAAAACGG - ACTGCATAGAACGT GTGTTC ACGC CGCGCAAGT - AGAT CG A 
TT T CAT CCCAAAGAACTT CT GGAGTGT GCCTTT GA7A T T GTCACTTCT AC CACCAACAGCT TTCTGCC CACT GCTGA 
AATTATCTACACrATCTATGAAAT^IATCC 

AT ACCAT GT T ATTGAAAGCAATACTCTTACAC'l'GTGGG AT CCCAGAJ\GAT2^ k ClCAGTCAAX; 1" CI ACATTAT T CT3 
T ATGATG CTGTGAC AGAG AAGCTGAC GAGGAGAGAAGTGGAAGCT AAATTT TGT AATCTGTCT T TGTCTTCTAAT A3 
TCTGTGT CGAC TCT ACAAGTT T ATTGAACAGAA3 GGA3ATT T GC AAGATCT^ATX^CAJ^AATAJ^ATTCA?^ AATA3 
AAC AGAAAAC AGGT ATTG C AC AGTTG G TGAAGT AT GG C TT AAJ^VG ACCTAGAGG AGGT TX^T TG G ACTG T TG AAGAAA 
CTCGGC ATCAAGTT ACAG GT TTGGGT CTTGATC AA7TT G GGCT^GGTTTACAAGGT GC AGCAGCACAATG GAATCAT 
CTTCCAG TT TG TGGCTAT C ATC AAAC GAAGGCAAP-GG GCTGT ACCTGAAATCCTCG CAGCI GGAGGCAGATATG ACC 
TGCT G ATTC CCCAGTT^AGAGGGCXIAC AAGCTCTG GGQCClvGTTC CCACTGCCATTG GGGT CAGCATAG CTA TA.GA2 
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! AAGAT AT C T GCTSCTGT CCT C AACAT GGAGGA AT CTGT A A GT T CTGTT ACAAT AGGCTCT GGGG AC CT CCTGGTTGT 
AAGTGTGGGC CAGATGTC ?AT GTCCAGGGCCAT AAACCTAACCCAGAPACTCTGG ACAGC^ 
T CATGT ACGACT-SCrrCAC^iGTT TCAGTCCCAAGAGGAAT T ACAAJGAGT ACTG C AGACATCATGAAATC ACGTATGT G 
C-CCCTTGT CT CGGATA A AGAAGGAAGCCA1 GT CAAGG TT AAG T ZT TTC GAG AAGGAAAGGC AGACAGAGAAGCGTGT 
GC TGGAG ACT GAACTT G T GGACCAJ GTAd GC AGAAACT GAG GACTAAA.GT C ACT GATGAAAGGAATT TT AG AGAAG 
C TT C CGAT AA T CTTGC AG TGCAAAAT CT SAAG G G G T C ATT T TCT AATGCT TCAGGTf TCTTTGAAATCC ATGGAGC A 
P£AGTGGTTCCCAT^TGT GAGTGTGCT AGCCCC GGAGAAGC I GTC AG^ AGCACTAGGAGGCGCT ATGAAACTCAGGT 
AC AAACTCGA.CTT CAG ACCT CCCT T GCCAACTT ACA ± CAG AAAAGCAGT G AAATT SAAATT CCGGCTGT AG TGGAT C 
1 ACCC AAAGAAA2AAT AT T ACAGT T T TTATCATT AGAGT GGGAT GCTG ATGAACAGGCAT T TAACACAAC TGTGAAG 
C AGCTGCT GT CACGCCTGCCAAAGCAAASATAC C T CAAAT TAGI CTGT GA TGAAATT T ATAACAT CAAAGT AGAAAA 

i AAAGGTGT CT GTGCT ATT TCTGTAC AGCTATAGAGATGAC T ACT ACAGAA TCT TAT T TTAA 



An open reading frame (ORF) for MOL2 was identified from nucleotides 1 to 4986. The 
disclosed MOL2 polypeptide (SEQ ID NO:4) encoded by SEQ ID NO:3 is 1662 amino acid 
residues, has a molecular weight of 1 88250. 1 and is presented using the one-letter code in Table 
5 2B. The SignalP, Psort and or Hydropathy profile of MOL2 indicate that this sequence does not 
have a signal peptide and is likely to be localized to the nucleus. Therefore it is likely that 
MOL2 is available at the appropriate sub-cellular localization and hence accessible for the 
therapeutic uses described in this application. 



Table 2B« Encoded MOL2 protein sequence (SEQ ID NO:4)» 

HAGGRGAPGRGRDE PFE S Y PQRQ CH2LQALEAI YGADFQDLP ^DACGPVKV^P PEIN1-VLYPOGLTGE&VYVKVDL 
RVKCPPTYPDWPE IBI*KMAIC3LQK15SVNLLK3RLBC LAKKTCGEWMI FELftYIIVQSFLflEI INKFFFKC F1IEEKLD 
RRAQE 2QQRLLEAQAEKPAQQREI LHEIQRRKE E I KEE KKRKEHAKQERLEI ASLSNQDHTSKKDPGGKRTAAI LHG 
GS ? DFVGNG KHRAN SSGR S RLRRERQY3VCNSE D3 PG SCB I LYFWHG5 ? DQLMVHKGKC 1 G S DEQ liGKLV YHAL2 TA 
T GGFVTJjYEWVTiQWQKKMG PFTiT SQE KKKT DKCKKQI QG7ETE FW 3 LVKTio HFK WR YLAMN 3 K 3 KTT P FWD I "LVY. 
HISGVSLAAHLSHS G PI P VHQLRR YT AQLL S GIiD YLH 3 NS WHK VTj S AS NVLVDAE GT VKI T DYS ISKRLADICKED 
YFBQT aVRFS DNAL PYKT GKKGDWRLGLL LL3L3QG QBCC-Z- YD VT I PS DLPAD FQD FLKKRCVCLD DKERWS PCQL 
TjKHSFIKFGBKMPLVEQSPESEGCDY^TSTVIPSNRLPSAAFl?^ 

DGCC YAVKR I P INP AS RQ FRRI KGEVT LI S RLHHZNI VRYYNAWI SRHE^PAG PGTP P PD5G PLAIODRAARGQ PAS 
DTK3LDSVE?J\APPPILSSSVEWSTSGERSASAR?PA^ 

EK SK3 QKQDE DCNE KNG CHESEPS VT TEAVH YLY IQMEYCE3 ST LR DT I DQ3LYRDT VRLftRLFRE ILDGLAY 1 3 EK 
GM Di R3LKFVN I Fh DS DDH VKI G C FGLATDH IAFSADSKQD DQT G Xil KS DPSGHLTGMV GT ALYVS PEVQGS TXSA 
YNQKV:>IJSLGIIFFl^YHPM\nASBRIF^ 

KS EJ ifr? ? BQME E 5 R LKE VTjHHTI.TK'VnGKA YRTMHAQr ESQ R IS PAJTDYTYDSDIT.KGNFS rSTAKMQQHVCBTI TR 
I FKRHGAVC LCT PL LLP SO^RQT YEfiNE AAL FKDH 5©IL VML P FDLRVPFARYVARNN I LIELKRYC I ERVF TPPvKLDR 
FEPKELLECAFDIVTSTTS SFLDTAE IIYT I YE I IQEF?ALQERMYSI YLNHTMIiKAIlLHCGI PEDKLSQVYI 
YDAVT3 KLT RREVEAK FCNLSLS SNS LCRL YK F 1 2 QKGCLQ DIjMPT INS L I EQKTG I ACLVKYGLKDLEEVVGLLKK 
L3 tKLQVWVLINLGLYYKVQQflNGII FQFVAIIKRRQRAVPE XLAAGGRY/DLLI PQFRGPQALGPVPTAIGVS IAT D 
KISAAVLNMEESVSSVTIGSGDLLWSYGOM^ 

alvsd:<egshvkvk$fekerqtej<rvletelvdhvlqvj^t:ot 

twpr^svijijeklsastrrryetqvqtr^ 5eiei lavvdlpketil qfl s lew dade qafwt tv k 

QLZiS RLPKQRYLKLVCDS I YN1 KVE KKVS VLFL Y3 YR3CYY5 ILF 



The MOO nucleic acid sequence has 3 1 1 9 of 3 723 bases (83 %) identical to a Mus 
musculus GCN2 ETF2alpha kinase mRNA (GENBANK-ID: MMU243533|acc:AJ243533 ) 

The full amino acid sequence of the protein of the invention was found to have 479 of 
1662 bases (88 %) amino acid residues (88 %) identical to, and 1 554 of 1662 residues (93 %) 
similar to, the 1648 amino acid residue CAB58363 GCN2 EIF2alpha kinase protein from Mus 
musculus (ptnn TREMBLNEW-ACC: CAB58363 ). 

Other BLAST results including the sequences used for ClustalW analysis aTe presented in 
Table 2C, 
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Table 2C BLAST results for MOL2 



Gene Index/ 
Identifier 


Protein/ 
Organism 


1 Lengtli 
iaa) 


Identity 
(%) 


Positives 
(%} 


Expect 


?atP:B65663 


Pro- sin 
kinase [Homo 
sapiens] 


1649 


1626/1662 
(97%) 


1632/1662 
{90%) 




PatP:B43581 


Cancer 


604 


592/609 


534/609 


6.5e- 




as subletted 
pro~ sin (Homo 
sapiens} 




(97*) 


(57%> 


306 


PatP:B427 61 


ORT252S 


619 


585/624 


596/624 


1.3e- 




polypeptide 




(93*) 


(95%) 


300 


gi 1 10764165 *. gb | AAG2 
2591.1] (All 93344) 


[Kus 

muse, i lus] 


1 lb/0 


1313/1577 
(83$) 


1375/1577 
(36%) 


0.0 


gU1136C320!pir||r4 


pcobStfcle 
translation 
initiation 
factor eiz- 
2 alpha kinase 
(EC 2.7.1.-) 
[similarity] 
- human 
( fragment) 


938 


867/946 
(93%) 


859/946 
(93%) 


0.0 


gi|730S017|ref IKP 0 
38747.11 


GCN2 

eIF2 alpha 
kinase [Mus 
musculus] 


1648 


1374/1647 
(83%) 


1442/1647 
(37%) 


0.0 


gi|l0764161|gb|AAG2 
25B9.1I 


GCN2alpIui 
[Kus 

musculua] 


1370 


1189/1331 
(86*) 


1241/1381 
(59%) 


0.0 


gi 1 724 3057 I do} I3AA9 
2576.11 (AB037759) 


KIAA1338 
protein [Homo 
sapisns] 


1195 


1377/.460 
(94*) 


1380/1460 
(94%) 


0.0 


gillU764163.gb|AAG2 
2590.11 (AT193343) 


GCN2beta [Mus 
musculus] 


1648 


1373/1647 
(83%) 


1440/1647 
(57%) 


0.0 


gi 16065914 I emb] CABS 
8360.11 (AJ243420) 


pu-ative eIF2 
alpha kinase 
[Bono 

sapions] , 


548 


505/556 
(90*) 


507/556 
(90%) 


0.0 



This information is presented graphically in the multiple sequence alignment given in 
5 Table 2D (with MOL2 being shown on line 1) aa a ClusialW analysis comparing MOL2 with 
related protein sequences. 



Table ID. Information for the ClustalW proteins: 

i: Novel M0L2 (S2Q ID NO: 4) 
10 2; gi 1 107 64 1 65 1 gb 1AAG22591.1I (A7193344) GCN2gamioa [Mus ruscuLus] (SEQ ID 

NO: 35) ,. 

3) gi|1136032Olpir|lT46S24 probablo translation initiation factor cIF- 
2alpna kinase [EC 2.7.1.-) [siriilarityj - human (fragment) (GEQ ID MG:36) 
4} gij 7305017 !ref I NPJ)38747 .1 i GCN2 eIF2alpha kinase [Mus musculus] (SEQ 
15 ID NO:37) 

5; gif7243057|dbj{BAA92576,l| (AB037759) KJAA1330 protein [Homo sapiens] 
(SEQ ID NO: 38 !• 

6: gi|1076<163|gblAAG22S90.1l (AT193343) GCN2beta [Kus nusculusj (SEQ ID 
NO:39) 
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7) g± 1 6065 914 I emb I CAB58360.il (AJ243428) putative c!F2 alpha kinaae [Home 
sapiens] (£EQ ID NO: 40) 



10 20 30 40 50 GC 

....|....| ....|....| ....|....| .. . . | . | ...-|....| 

MDL2 Prote MAGGRGAPGR SRDZPPESYP QRQDHELQAL EAIYGADFC.D iRPDACG^VK VKEPPEINLV £0 

AF193344 MRTQRAL ? 

NP_038747. MAG GRG ASCI* GRAEPQE3Y& URyuHJiLQAL EAIiGSUbgC U.U?L)ARG — R VWiUl/EINLV 58 

LLL 9 

AW 93343 fcCAGGRGASGR GRA3PQE5YS QRQDHELQAL fcAIYGSDFgD LWOARG--R VfcUH??EINLV 58 



MDL2 Prote 
AFl 93344 
T46924 
NPJ>38747, 
AAG22589 . 1 
RB0 37759 
ATI 93343 
AJ2 43428 



HOL2 Prot© 
AF193344 
P46924 
NP_C3B747 . 
AAG22 589.1 
A3037759 
API 93343 
AJ2 43428 



XOL2 Pxotc 
API 93344 
T46924 
NP_038747. 
AAG22599.1 
AB0377S9 
API 93343 
AJ243428 



M0L2 Prote 

AF193344 

T46924 

HP_038747 . 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



MOL2 Prote 

AP193344 

T46924 

)IP 038747. 

AAG22589.1 

AB0377S9 

AF193343 

AJ243428 



MDL2 Prote 
I API 9334 4 
146924 
NP_038747 . 
AAG22589.1 
A3037759 
AT193343 



73 80 90 100 

....|....| 1 I ....}....! . 1 

tYPOGlTGEI VYVKVDLBVK CPPTYPDWp EIBI&HAKGL SNKSV^I 
, LVP EXECKHARGL SN^aVN) 



11 D 



120 



RLEEZAKKHC 
HLEEjftKKQC 



IYPQGLAGEB VTVQVELCVK CPPTYPDVYP EIELKNAKGL S^S^XjJjKg HLEELAKKQC 

FFPTCtQ 

EYPCGLAGE2 VYVQVELQVK CPFTYP^WP EIBLKNAKGI. SKpS^jxj 




120 

40 

1 

118 
1 

18 

118 

L 



130 240 
I ....].. ,.| 
GEVVKITELA YHVQSFIiSEH 
G-EVKEFEIA SJ-IVQSFLSILH K! 



180 



150 160 170 

I.... I ....|....| I .... I ....|....| 

K3FHE BMLERPAGBE QQRUi'AQAE RRAQCREIlB 
KSFHR RMT.KRQAGEK QQRT.T.2ARRK EEOJSORKttJ 



G-EVMIFELA HH70SFLSL13 NKjajPKSFllE EMLERGAQEK QQRLLjIARRX ESQEQHELLfi] 



G-EVHIFBLA H1T/OSF16EU 



•XSFHE EMttEtlQAQEK Q0RLL2ARRK EEQEQEE! 



180 

39 

1 

177 
1 

23 

177 

1 




iai*a»aaMreg ««aiPM TESfe agg&QRg] E BBhkWi| l»MJdiWi«y j| 237 



EI'^P.EKtitllKBE EKKRKEKAfc 
EI EREKEEI k|e EK KRKEKAK 



250 



1 
83 
237 
1 
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M0L2 Prcte 

AFl 93344 

T46924 

MP 03B747, 

AAG22589 . 1 

AB037759 

AF193343 



MOL2 ProU 
AF1 93344 
T4G924 
KP 038747, 
AAG22589 . 1 
AB037759 
API 93343 
AJ24342B 



M0I|2 Pro fee 

AF193344 

T46924 

NP_03B747. 

A4Q22589.1 

AB037759 

an 93343 

AJ24342B 



MOL2 Prore 

AF193344 

T46924 

NP 038747. 

AAG22 589.1 

AB037759 

AF193343 

AJ243428 



M3L2 Prote 
AF193344 
T46924 
NP_038747 , 
AAG22589.1 
AB037759 
Ml 53343 
AJ243428 



MOL2 Proto 
API 93344 
T46924 
NP 038747. 
&&G225B9.1 
A3037759 
API 93343 
AJ2 43428 



M0L2 Prote 

API 93344 

T46924 

HP 038747. 

AA622589.2 

A8037759 

AP193343 

AJ243428 




450 

■ I 



460 
1 



470 



480 



EGTVTC.IT D Y3BIS ERLADICK 
EGTVKIT D YsBlS KEL ADICfc 
EGTVKITDYsllSEEIiADICt 
EG I V XI T DYsI I S KRL AD I C [ : 




490 
..I . 



500 



510 



ECSEYPVTIP 




520 

-) 



530 







3?* 




















5DLEADFQDE 
riDLEftni?QDF 
SDLPADFQDF 
SDLPHriFQnF 




LKK 
LKi 
TjKI- 




C7CLDD 
CVCI.DE 
CVCLDD 

cvcr.nn 



540 



EERWSPQ^LL 
KRUWSPQ0T.T. 





RYFTEEEGLQ 
KYFIfelFiiKLO 



KIIS FISpQPKKPLVEQS PE D|3G3Q 


DYgjjETV IEdggLPCrta FF&ETCaQFCfcy FIEFS 


ELO 


EHE FIjJpnPFK?LVE03 PElJsGSQ 


DYHETV IPsRlPSAA FFSETC SqFsRp. YFIEFE 


ELD 


EH£ FiSpQPfISpLVEQS PED13GSQ 


DYWETV IP^BjLPSAA FF5ETc3oF3K.YPIEFE 


ELQ 


KH £ Fi"p0PfB ?L VE0S DE l|sC ZQ 


DYgETV IDsB?LPSAi\ FF&ETCj@C'Fs|rYFIEFE 







LLGKCAFGAV 
LL j KGAFG A V 

LLSKGAFGAV 





FPJlI^Jii'V i'LLSRLH 




PASFB 


ffjukIqevtllsrih 


|HeSiVRY"J 




F£RX t J '_- E I'LL 5 H1K 


IheHivky A 


?ASPg 


FRRHsltEVTLLBElH 





540 
455 
1 

533 
255 
330 
533 
1 



599 
515 
1 

$93 
325 
440 
593 
1 



575 
1 

653 
375 
500 
653 




vrr e RF.F.P p a wgt p e p c 
n IERHBRPaHpGTP PP U 
KlERHtSEPaHeGTP PPO 
WTFRHFRP^JJpGTPFPEj 





• 1 . 



610 



€20 
• I 



830 



'/H.YLYIQHE* 
VHYIA'IQMEY 
7HYLYIQWEY 
VHtfLYIQMEY 
VHYWflQMEY 
WiYLYIQMEY 
7HYXYIQMEY 




•I. 



B40 
..I 



LWELF.REILC 
LWRLFREILC 
LWRLFREILC 
LWRLFREILE 
LWBLFREILD 
LHRLFKBILE 
T,t'«Rl,FRElLL- 



839 
T54 
123 
832 
554 
680 
832 
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.1. 



37 C 
..I 



FLDSDKiVKl 

fldsddhvk: 

PLDSDDHVIC 
BXDSDDHVXI 
FLDSnDHVXl 
i'LLk*OOHV_\L 
FLDSDDHV3Z 




910 



920 

J 



33C 



530 
. J 



950 
.1 



960 




MOL2 Prote 

AF193344 

¥46924 

HP 038747. 

AAG225B9.X 

&B037759 

API 93343 

&J243428 



MOL2 Prote 
AFX 93344 
746924 
KPJJ38747 . 
AAG225B9 . 1 
AB037759 
AF193343 
AJ243428 



MDL2 Pro-be 

AF193344 

T46924 

HP 038747. 

AB037759 
AF193343 
AJ243428 



MDL2 Prote 

&F193344 

T46924 

NP 039747. 

AAG22589 . 1 

»B037759 

AF193343 

AJ243428 




1310 
..| 



1020 
• •I 



e:llk3EL1?s 




PCjMGGSELHE 


1318 






PQMF.ESELHE 


934 


ELLKSELL?? 




PQMEESELHE 


3C2 


EI,LKSELL?F 




PQMEESELHE 


1C12 


ELLK3£LL?F 




PQUEESEIiHZ 


734 


ELL-KSELLP^ 




PQHEESBLHE 


859 


ELLKSELij!?? 




PQMEESELHE 


1312 









10f.O 

. I . 



1370 



1080 



DYTYD32ILK 
DYTYD3DILP 
DYTYDSSILK 
DYTYDSDILIt 
DVTVDSDIII' 
DYTYDS DIIF 
DVT YDS DTLF 




..I 



1100 

,.1 



1110 

..I .. 



112 



1130 
-I 



1140 

.1 




1150 
J. • 



1160 



♦ 1 • 



1170 

..I - 



LKRYCIERVfc' 
LKRYCIERVTf 
LKRiCIERVF 

LKRYCIERVF 
LKRYCT.F.RVF 
T.KRYCIERVF 
LKRYCIERVF 



.121.0 
■ I 




KELLECAFD? 
KEIiECAFCI 
KETif'ECRF"T 
KF.l.r.ECAFDT 
KElftBCftFO] 
KELtECAFDT. 




1190 



1200 







































3 







1196 
1114 
432 

11.92 

914 

1339 

1192 

92 




1230 



1240 



MLLK&ILJiHC 
ML1KAIL1HC 
MLIKAIILHC 
HLLiRAXlJ/fiC 
MLLKAILLHC 
tJLIiK?.ILLHC 
fcJLLK&lLLHC 
HriKATLTfHC 




r lYDAvi eIkltree vesfIf C 



1250 

i 

i 



I1YDAVTE KLTRREVEA?, FC 
LLYDP.VIE KLTKHL'VKAK E'C 
ILYDAVIE KLTRRBVBRK 
I LY DAVIE KLTRREVEAK FO 
ILYDAVIE KLTRREVEAK FC 
ILYDRVTE KLTRREVRAF FC 
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Chromosomal information 

MOL2 belongs to genomic DNA [Acc.NO.; AC025168 from GenbankNEW]. Within this 
GcnbankNcw entry was a note showing that the sequence was from Chromosome 15ql4. 
5 Therefore we assign the chromosomal locus of this invention as Chromosome 15ql4. 

Tissue expression 

MOL2 is expressed in at least the following tissues: brain and liver (derived from 
literature sources) and thyroid (derived from 20466828_EXT1). 

10 Based on information available on expression of S\V1SSPROT-ACC:P29089 TYPE-IB 

ANGIOTENSIN El RECEPTOR (AT1B) (AT3) - Ratlus norve&cm (Rat), the closest G-protein 
coupled receptor family member it is likely that MOL2 is expressed in cardiac tissue, renal 
tissue, and vascular tissue as angiotensin is expressed in these tissues.MOL2 has similarity to the 
murine GNC2 cIFK protein, a possible GNC2 elFK and other GNC2 elPKa and their functions 

15 as described in but not limited to the references below: In cukaryotic cells, protein synthesis is 
regulated in response to various environmental stresses by phosphorylatmg the alpha subunit of 
the eukaryotic initiation factor 2 (eIF2aIpha) (Berlanga, etal., Bur J Biochem; 265(2):754-62; 
Oct, 1 999). Three different eIF2alpha kinases have been identified in mammalian cells, the heme- 
Tegulated inhibitor (HR.I), the interferon-inducible RNA-depcndent kinase (PKR) and the 

20 endoplasmic reticulum-resident kinase (PERK). A fourth eIF2alpha kinase, termed GCN2, was 
previously characterized from Saccharomyees cerevisiae, Drosophila melanogaster and 
Neurospora crassa. Berlanga et at 1999 describe the cloning of a mouse GCN2 cDNA 
(MGCN2), which represents the first mammalian GCN2 homolog. MGCN2 has a conserved 
motif, N-terminal to the kinase subdomain V, and a large insert of 139 amino acids located 

25 between subdomains IV and V that are characteristic of the known eIF2alpha kinases. 

Furthermore, MGCN2 contains a class II aniinoacyl-tRNA synthetase domain and a degenerate 
kinase segment downstream and upstream of the eIF2alpha kinase domain, respectively, and 
both are singular features of GCN2 protein kinases. MGCN2 mRNA is expressed as a single 
message of approximately 5.5 kb in a wide range of different tissues, with the highest levels in 

30 the liver and the brain. Specific polyclonal anti-(MGCN2) immunoprecipitated an eIF2alpha 

kinase activity and recognized a 190 kDa phosphoprotein in Western blots from either mouse 

liver or MGCN2-transfected 293 cell extracts. Interestingly, serum starvation increased 

eIF2alpha phosphorylation in MGCN2-transfected human 293T cells. This finding provides 

evidence that GCN2 is the unique eIF2alpha kinase present in all eukaryotes from yeast to 
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mammals and underscores the role of MGCN2 kinase in translational control and its potential 
physiological significance. 

A family of protein kinases regulates translation in response to different cellular stresses 
byphosphorylation of the alpha subunit of eukaryotic initiation factor-2 (eIF-2alpha). In yeast, an 
5 eIF-2alpba kinase, GCN2, functions in translational control in response to amino acid starvation. 
It is thought that uncharged tRNA that accumulates during amino acid limitation binds to 
sequences in GCN2 homologous to histidyl-tRNA synthetase (HisRS) enzymes, leading to 
enhanced kinase catalytic activity. Given that starvation for amino acids also stimulates 
phosphorylation of eIF-2alpha in mammalian cells, we searched for and identified a GCN2 

10 homologuc in mice. Sood ct.al., 2000 cloned three different cDNAs encoding mouse GCN2 
isoforms, derived from a single gene, that vary in their amdno-terminal sequences. like their 
yeast counterpart, the mouse GCN2 isofonns contain HisRS-related sequences juxtaposed to the 
kinase catalytic domain. While GCN2 mRNA was found in all mouse tissues examined, the 
isoforms appear to be differentially expressed. Mouse GCN2 expressed m yeast was found to 

15 inhibit growth hy hypeiphosphorylatiori of eIF-2atpha, requiring both the kinase catalytic 

domain and the HisRS-related sequences, Additionally, lysates prepared from yeast expressing 
mGCN2 were found to phosphorylate recombinant eIF-2alpha substrate. Mouse GCN2 activity 
in both the in vivo and in vitro assays required the presence of sexine-51, the known regulatory 
phosphorylation site in eIF-2alpha. Together, those studios identify a new mammalian cIF- 

20 2alpha kinase, GCN2, tfiat can mediate translational control Phosphorylation of the alpha subunit 
of eukaryotic initiation factor 2 (eIF-2alpha) is a well-characterized mechanism regulating 
protein synthesis in response to environmental stresses (Yang et al., Mol Cell Biol; 20(8):2706- 
17; Apr, 2000) . In the yeast Saccharomyces cexevisiae, starvation for amino acids induces 
phosphorylation of eIF-2alpha by Gcn2 protein kinase, leading to elevated translation of GCN4, 

25 a transcriptional activator of more than 50 genes. Uncharged tRNA that accumulates during 
amino acid limitation is proposed to activate Gcn2p by associating with Gcn2p sequences 
homologous to histidyl-tRNA synthetase (ffisRS) enzymes. Given that eIF-2alpha 
phosphorylation in mammals is induced in response to both carbohydrate and amino acid 
limitations, we addressed whether activation of Gcn2p in yeast is also controlled by different 

30 nutrient deprivations. It was found that starvation for glucose induces Gcn2p phosphorylation of 
eIF-2alpha and stimulates GCN4 translation. Induction of eIF-2alpha phosphorylation by Gcn2p 
during glucose limitation requires the function of the HisRS-related domain but is largely 
independent of the ribosorne binding sequences of Gcn2p. Furthermore, Gcn20p, a factor 
requited for Gcm2 protein kinase stimulation of GCN4 expression in response to amino acid 
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starvation, is not essential for GCN4 translationa! control in response to limitation for 
carbohydrates. These results indicate there are differences between the mechanisms regulating 
Gcn2p activity in response to amino acid and carbohydrate deficiency. Gcn2p induction of 
GCN4 translation during carbohydrate limitation enhances storage of amino acids in the 
5 vacuoles and facilitates enfry into exponential growth during a shift from low-glucose to high- 
glucose medium. Gcn2p function also contributes to maintenance of glycogen levels during 
prolonged glucose starvation, suggesting a linkage between amino acid control and glycogen 
metabolism. 

1 0 Uses of the Compositions of the In ven Hon 

The expression pattern, map location and protein similarity information for MOL2 

suggest that it may function like a member of the GCN2 elFaK family. Therefore, the nucleic 
acids and proteins of the invention are useful in potential therapeutic applications implicated, for 
example but not limited to, in various pathologies /disorders as described below: 



15 



Hjperfhyroidism 
Hypothyroidism 

Von Hippel-Lindau (VHL) syndrome 



Alzheimer's disease 



20 



Stroke 



Tuberous sclerosis 



30 



25 



Hypercalcemia 
Parkinson's disease 
Huntington's disease 
Cerebral palsy 
Epilepsy 

Lesch-Nyhan syndrome 
Multiple sclerosis 
Ataxia-telangiectasia 
Leukodystrophies 
Behavioral disorders 



Addiction 



Anxiety 



Pain 
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• Cirrhosis 

• Transplantation 

• and/or other pathologies/disorders. 

5 Potential Iherupeuiic uses for the invention^) are, for example buL not limited to, 

the following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gsne therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 

10 tissues and cell types derived from these tissues). 

The nucleic acids and proteins of the invention arc useful in potential therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies and disorders. For example, but not limited to, a cDNA encoding the GCN2 elFaK 
-like protein may be useful in gene therapy, and the GCN2 elFotK -like protein may be useful 

1 5 when administered to a subject in need thereot By way of noulimiting example, lie 

compositions of the present invention will have efficacy for treatment of patients suffering from 
Hyperthyroidism, Hypothyroidism, Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, 
Stroke. Tuberous sclerosis, ffypercalceimia, Parkinson's disease, Huntington's disease, Cerebral 
palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, 

20 Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection Cirrhosis, 
Transplantation and/or other pathologies/disorders. The novel nucleic acid encoding the GCN2 
elltaK-like protein, and the GCN2 elFoK -like protein of the invention, or fragments thereof, 
may further be useful in diagnostic applications, wherein the presence or amount of the nucleic 
acid or the protein are to be assessed. These materials are further useful in the generation of 

25 antibodies mat bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 



MOL3 

An additional protein of the invention, referred to herein as MOO, is a human 
30 complement C3-Hke protein. The novel nucleic acid was identified by TblastN using CuraGen 
Corporation's sequence file for MOL probe or homolog, run against the Genomic Daily Files 
made available by GenBank. The nucleic acid was further predicted by the program GenScan™, 
including selection of exons. These were further modified by means of similarities using 
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BLAST searches. The sequences were then manually corrected for apparent inconsistencies, 
thereby obtaining the sequences encoding the full-length protein. The novel nucleic acid of 4894 
nucleotides (82254077.0.1, SEQ ID NO:5) encoding a novel olfactory receptor-like protein is 
shown in Table 3A. An open reading frame (ORJF) was identified beginning with an ATG 
5 initiation codon at nucleotides 1-3 and ending wifli a TGA codon at nucleotides 4837-4839, 
Putative untranslated regions downstream from the termination codon are underlined in Table 
3A, and ihe slarl and stop codons are in bold letters. 



Table 3A. MOL3 Nucleotide Sequence (SEQ ID NO:5) 

ATGT CCCCQZClCI CCC T TTCTCT GCCAT CTC CCT ATGTCTCTC CCAT CT CCCTAT GC CT C TG CC ATCi'CCAAGG 
TACATC'JT<S5T GACCCCCCGAGTT CT GAGGGT TGGCAGTCCCi 5AGEASC ATTCACAT TCAG GCC CAC TCAGAC T CC 
AGAC AGCCCC T CAC AAGGACCCTCAAGCTGAACCTCACAG TG T GGGACTTC CCCA1 GAGGA&GACA3TGTTGGGA 
AGGAGCCAGC T CAT TC TCTCACCAGGAAACAACj? TTATG GAC CAGGCACCTCTGACCCTTCCCC AGACCCTGA! G 
TACC TCCC AAA^CCAGGG CAGCMTATGTC^CATCCGGGCAAC TTGGGCUCCCRCOT CGGGCT CCTCATTCAJ G 
GAG A AGATGGT GCTGGTGGCTCTT C^TGCT GGCT ACJV^CT^T ATCC^ T 
CCC CT AGTTC ACT ACCGG GT GTTC AC T GTGAAXC ACAAQAT GCACCCTCTGACCACC ACAV^ C 
AAGAATCXTTGATGGSTCCQ&UaCTTCCAGAGTCC^ 

?GG GGCCCTC A£ AAGT CC CT G7GC C7 C AGT T T GG GGACCTGGACCJ\T CGAAGCC AGCTACC AAAGT ACACC CAAG 
CAGAAGT T CGAGGC TG CC TYT C? V TGT GAAG GAAT ATCTCCTC C CAT CM 1 ITTGAGGTCCAGC^GCTCCCAAAT AAG 
AC TTTC T I TT ACCTCAAGGAT GAGCCTCT G3G CGT TGACAG'C C AGGC T CGGT ATAT AT T/TAAC AAGCCAGTGGAC 
GGA<^TGCOTGCTCATCTTTGG<3GTGAAAT^ 

GAGGTGAC TGAAACAGGGGGTGAGAT GGT G CAAGCTGAGACC TCAGGGGTGAAGATC AT CCAGAGCCCATACAAC 
AT CAAG T T CACCAG CA.CACCCCACT ATTTCAAGC CAGSAATGCCCTTCCACTrTCGGG TCAGAGTC GT ACAAAGC 
AG T CCT AT TC AGATCA7 ATT CCAGTC T CACCTCTC AC ACCAC-GCCACTGCAGGC TTTT CCTT CACCTT AGCCCAG 
ATTCCACC TCAGGTCTTC AT CT CAAAT CCTGATG GGTCCCCAGCT TCC AGAGT CCTTGT CCACTCC CAA3ACCAA 
AAAGTG 1 ACACCT CAGCTGAGGGGC?T GGCCACTC TGACCATCAACAC AGATGC AAATC T GGAC AAGCTCCCCAT C 
GAG GT GAAAACTG AGGAATCTCT T CAGCC AGAGGAGC AGGCf TCAGCCAAGAT GACAGCTT GGCCT TAG IT GACT 
CAGG AT GG GT CAGGAAAC T T CCT ACACATCGAAG T AAAGAC^_TTGGGC ACAGA3GTTCGCAGCAGC ATC CAGCT G 
AGCCTCAACACAAGGCATCAGGACCCTAAAACCAAGGACAAGACTACTCACTT- 
G3TAAGGCC::GGCAC^T?GGGAGGCAGGTGGaSCAG^^ 

CTTCCC AGA3GAGCAAGCC AAGACCCTGAGTT GGTGGCTGAT 1 CCAT ATGGAT TCATG TGAAT GACAGATGCAT A 
G GGCT GAAA3TTG G CT TGAAGAATGATAGATT CT TCCAGTCTTTG GAGCCCAACAGCC AAGTC GAAC T GAAGGT G 
ACAGG T GAT 3CAGAAGCCAC AGTGGGGCT G GTGGCTGTGGACAAG GCTGTC TATGTCT TGAAC AGCAAACACAAG 
CT CAC T CAGAAGRAGGT AT GGAATGT GGT GGAG GAAC AT GACAT TGGC T GCAC AGGAG GAAG TGGGAA^C ACAG A 
TT TGCT GTGT TCAAGGATGCT &GATTGGACCTGAAAATCAGCACAGGAAT GGATAGCGGC C ACCAGCA A^STCAC 
AG CT GCCAGGAGGC T GAGGTGGGAGAATC ACTTGAAC C C GGGAGGCAGAGGT T GCGGT CAG CCGAG AT r G CGCC A 
CT GCACTCCAGCGT GAACAAGT T TAAGAC AGAGCT GGAGC A^YftADTGC T GTGASGCTGGG C T CCG 3 GAS AKJCC A 
GT GGGGCT GT CTT o T GAG C- AGAGGACCTGGCAT GTCCGCCATGGT CC AGCCT GT GTGG CT GCTTT C CT G GACT GC 
TG CTCACAC 2-TGC T CCCTCCAGCGGAl GAAGAAGAGGACTTCGAT G AC CT CT TCTTGG AT GACAT GCC T GTGCG G 
ACCTTGTTCCCCGAGAGTTGGCT CTGGAACAGC AT CT C CCATTAC CCC AT CTC T GT GAAGGT GCCAGAT TCCATC 
ACC^GTGGCAC^T^GTSGTGGTCAGCCn^ 

AC AGT T ATGAAATCGTT C T TT GT GGACCTT AAGT TGC C C TCCTC CGTG AT CAG GAATGAGC AGGTCCAG ATCCAA 
GCCAT GTTGTACAAT TT CAJ3G GATCGCCAGGCCAAGGT CCGAGTGGAGT T CCC CCACA AG G AGAC ACTGT GCAGT 
GCGTCAAAGCCAGGAGCACCATCCIl^CCAG GT AGTGGTCGT GCCC C CCACCT CCTC CAAGATAGTACACTTTGTG 
. C T TC7C CCT CTGGAGACAG GCAAAGTGGA.C GT GG AGGTCAAGGCT GlX^GGC T ACGGGGTCCAGG ACCAT GTG AAG 
: AAGAC ACTC TTGGT CGAAG GTTG T GGTT AT TCAGGTC AGACCCAGACA AAACTGGT G CCAAGAC AGGAGT^CTTG 
AACATG GT ACCCGACACGGAG 3 CGGAAGT GTT^ATCASTGTTCAAGGTG ACATC CT T GGTG AGACAATT GTSGGC 
AGCCTG ACACCCAGTGAG ATT C AGCAG CTGCTGCGGG T CCCCACG GGCTGCCCTGAGCAGACGCT GAGCTCCCTG 
AC GCCCGT CA'rCAT CCTGT^CGCT AT/WGGAT ACCACCG SCCAGTGGGGC AAGGTCGGGGTGGAGCACAGGGAC 
CAGGTGAT G AAC^AATATT GGCTAC ACTCAGAT GCTGACCC AC CGGAGT T CAGACGG C ACCT ACC AC ACC TCCAAG 
GGGAACCCAGGAAGCACTTGGCTC AC AAGCTAT GTGTT CC GC GTCTTTGC CCT GGC C TACT CT AT GAT GACGACC 
C AAGT GCT T AGCC T GTCCTCT CTC TGTGACAT GGCCAACT GGATC AT CAT CCACAGG CAGGCAG AG GAT GGGCAC 
TT CCT GGAG AAGG G CCCTGTC GTCATGACATCCATG T CCGAGGAG GAT GT AT CCCTCACAGCT CTTGT CCTAATA 
GC CXTTGAATGAGGGAAAGGAGTTGTGC AG AC AGA AGGT AG3ACCCAATTTGAT GGCCAGCATCG AGAAGGCCGGA 
GG AT T GCI TGAGCTCAG GAGGT TGAGGCT GCAGC GGAGCT ATCCCGT AGCCAT AGCC'l^CCT ATGC AC!' G GCCGAC 
AAAACCCACT GGCC AGT GGAT GAGCAGAAT CTGGGCT CCCTGTACACC AT TGAGGCC ACAG CCT ATGGGCTCATG 
CAGAAGCTGGAGCTGGGCCGGT AC A ATGAGAC AC ACS C CAT AQCCAAGTGGC T ACTA3AG AAGCAGGAGCTGGGA 
GG AGGCTT CAJGGTCCACC CAGC CAGGCAGGAGC AGTG GCCT TTCCCAC CC AC AGCGCTGGC CACAG GG CT CCCTC 
KTGGCC CT TGAAGCTCTG ACCCGCTTCXXCGAAGCTGT<X;CCTTCAAGGGCATCCAGGATC 
AGAGCCCC CAAGACAGCC CTG AAT Gr GAAT T GGT ACATTGAT CAC AGC AAT GCCT ACCAACAGC GGTCAGCAAAC 
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TTCCTT GCCXIAGG ACGACCTA5AG ATC AAAGCCAGTGGCZ^C^33AGAGG CACCATCTCGAT CCT G ACAATGT AT 
CACAAGT CCCC AGAGT CCCGG GAGGAC AACT GCAACCTGT AC CACCT G AAT GCG ACT CTCCAC AGTGCCCTAGAA 
GAAAATAAAAAGGRAGCTGAGACTTTTCGGCTCCGGATC^AA^ 

ACT ATCATGGAGG TCTCC CTGCTCAC G GGCTTCTACCCC AACCAGG AT GACCTC AAAC AGC T CACGAG I GAJG T G 
GAGAGGT ACGCCT TTC AGTACAAAACCAAGAC AAGTACCAGC GACAGC ACT STTGTCCT C 7. ACCTG&AAAAGCTC 
7CCCATGASAAGAACACGGT GC rGGGCT T T CGGGTTC ACAG^ATGCTGCAGGCG^GTT CC TGC AOGCCGCCCT G 
GCC AC CAT CTACGACTACT ACGAGCCTT C CCG GAGO T GC AGC AC'f 'FTC T AC AACCTG CCCACAG AGC AATCTTC C 
CTGAJG AAAGATCT GCC AC AAAGACATCT GCAGAT GT GCAGAG GGACAST GCC CATCCCT GCAGAAGCCC AGT GG C 
C AAT T G AGGCAGGA6G AGCTCC AGACAACAGCAT CTGAGGCAGGCGT^ 

TCTGT GGAGGTC T CTGC CT CC AACCCTT ACGT CT AT T ACAACACGCAGCTC GAAGACAT CAT TAAGAGTGGT AC G 
GACCC TGC CAAAC CCC T GGCC!ArGAAGAAATT C.G TC TCCCATG CCACTTGCCATGAC TCCC TGGGGTTGCAACAA 
C AGGAATC GTACCTCATCATGGGCCAGACG TC AGACCTG T GGAGJIAT CAAATC TGATT ACAGCT AT GT TCTGGGC 
AAGGAGAC 2TTC CTCATCCTIT GGCC AGCACATG GAG ATSCCAC C AAG AAACAATTCCCCC ACCA^\CTC-GACC AA 
T T TT T GGAAT ATAT GC SCACCC^CGGCTGC C AS 1 ! CCTGAGCCT £ T TCTGCTTT CAGGGASG TG T CAT CAGGG CAG 
CTCTGGGCCACGTGGGTTT 



The disclosed MOL3 polypeptide (SEQ ID NO:6) encoded by SEQ ID NO:5 is 1612 
amino acid residues, and is presented using the one-letter code in Table 3B. The MOL3 protein 
were analyzed for signal peptide prediction and cellular localization. Signals results predict that 
5 MOD is cleaved between position 20 and 21 of SEQ ID NO:6. Psort and Hydropathy profiles 
also predict that MOL3 contains a signal peptide and is likely to he localized at the endoplasmic 
reticulum (certainty of 0.5500). 



Table 3B. Encoded MOL3 protein sequence (SEQ ID NO:6). 

MS PSIiPFGA I LCLG IILPKPIjPS PRY 2 L VT P RVLRVGB PES I HI Q AHSDS RQP LTRTIiKVNLTVWD F PHRKTVIi 
ARSQlilLSPGin^MDQAPVTVPESljMYLPKPGQQ^Vl 1RATWAPT5GSS FKEKM7LVALHAGYIF I QTEKT I YT 
P3PLVHYRVFTVNHK*DPVTRTFTTiDrXNPD3SPA 

T PKQKFEAAFD VKE YVL fc»S If'fclVQLYflil KT F* YJjKDEALUVU I QARY I FNKPVDGHAIi VI PGVKLDSCRIP IQSS 
LQ3VEVEETGGEMVQAETSGVKI IQS P YHI XFTRTPQ YFKPGMPFEFRVRVVQ S S P I Q I IFQSHLSHQATAG7S 
PTIiP QX PP Q VF I SNPDGS P ASRVL VHSQDQXVYrfiAE GLATI/T I HTDANLB KL P I EVKTEES LCPE B QAS AKMT 
AWPYLTCP3SGMTLH TEVKTLGTEVGSS I QISI^JTRHQE PKTXDKI THFTHiWREGKARQIjGRQVAQ VGVP 3 P 
RI LAFYlxLF RGASQD PEL VADS I WID VND RC E GLKVGLKND RFPQ SLE PNSQVELKVTODAEATVGDVAVDKAV 
YVLMSXHKIjT QKKVWKWEEHD I GCTGGSG3CDRFAW KCAGliDLKI STGMESGKQQSHSCQEAEVGB SliEPGRQ 
KZrRSAB I APLHS SVUKFKTELE C KCCEAGLRE S PVGL £ CE H RTWHV3H3PACVAAFLDC CSHIiLPP ADEBEDFD 
DZjPIODMPVRTLFPESWjWNSI SHYP I S VXVPD5 ITT W QFVWS LKAG3GGL CVSDP FE LTVMKS F FVDL KLP S 
SyikNHQVCI^MLYNFRDRQAKVRVBPPEKET^ 
KAVGWGVC3EVKK31*LVEG^GnfSGQTQ^ 

V? TGCPEQTL SSLT PVI I LS RYLDTTGO^^GKVGVEIIRDC VMKNI GYTQMLTHRS SDGTYHT9KQKTG9TWLTSY 
VFRVFALAYS MMTTQVL S LS SLCDMANWT I IDRQAEDGHFIjEKGPVVMrSI4SEBDVSLTAIiVL lALKEGKEIiCR 
OKVOPKHL^SIEI^G^LEIJIRLRI^^ 

ETHAI AKWLLE KQ2LGGGFRSTQ PGRSSRL SHPQRWP QGSLXALE ALT RFREAVPFK5 1 QDLEVQ I ItAPKTALN 
VNWY I DHSNAYQGK.S AKFbAQDDIiE J KASGWGRGTIS ILTNYHKSFESREmCNLYHIJ^TLPSAI3EMa0G<33 
T TFJjRMETRFQNMSEATMTI MB VS LLTGF YPtf QDDLKQLTS DVE RYAF Q YKT KTSTSD 5TWLYLE KLSIIE KST 
VLGFSVHRMLQAE FLQAALVTI YD YYEP 2 RRCSTF YNLPTE Q S S LRKI C3KE I CRCAEG QCPSliQKP SGQLROE 
ELQTTACEAGVDFVYKTKLES VEVSASH"P X V X WVQ&SD 1 1 KSQTEPAKPTjAKKKFVSHATCHDiSLGLQEQESY 
L I KGQT SDLWRI KSD YSY VLGKETPL ILItf PAD GOASKKELPJDQLBE 7LBYMRTHGC Q£ 



The Ml amino acid sequence of the protein of the invention was found to have 257 of 
10 734 amino acids (35 %) identical and 403 of 734 (54%) homolog to a Cavia porcellus (guinea 
pig) complement C3 precursor (contains: C3A anaphylatoxin) (ACC:P12387; 1666 aa), and 255 
of 717 amino acid residues (35 %) identical to, and 401 of 717 residues (55 %) similar to, the 
1663 amino acid residue complement C3 precursor (contains: C3A anaphylatoxin) from Homo 
sapiens (human) (ACCP01024). 
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The disclosed MOL3 protein (SEQ ID NO; 6) also has good identity with a number of 
complement oompontent proteins, as shown in Table 3C. 



Table 3C. BLAST results for MOL3 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 


Identity 
W 


Positives 

«*> 


Expect 


gi|4557395lref |N? 
_000055.ll 


compl axent 
component 3 
precursor [Homo 
sapiens] 


1GC3 


633/1750 
(35%) 


939/1750 
(53%) 


0.0 


gi|11869931|gblAA 
G40565.ljAF154.933 
1 (AF154933) 


compliant 
component C3 
!Sus szroZa] 


1661 


617/1733 
(35%) 


935/1733 
(53%) 


0.0 


309122 lgb|AAC4 
2013.11 (K02782) 


preprocompleraen 
t component C3 
[mus rnusculus] 


1€€5 


606/1734 
(34%) 


915/1734 
(51%) 


CO 



5 This information is presented graphically in the multiple sequence alignment given in 

Table 3D (with MOL3 being shown on line 1) as a ClustalW analysis comparing MOL3 with 
related protein sequences. 



Table 3D. Information for the ClustalW proteins: 

10 1) Novel MOL3 (SBQ ID N0:fc) 

2) gi | 4557385 |ref|NP C00055.1| complement component 3 precursor fHonio sapiens] 
<SE2 ID NO: 41) 

3) gi jllfc69931|gb|AAG40565.1|AF154932_l (AF154933) complement component C3 [Sus 
scrofa] (SEQ 13 NO: 42 ) 

15 4) gil309l22|gb)AAC42013.1| (3C02782) pre-oroccinolenient component C3 [Mas 

musculus} [SEQ ID NO: 43) 



MDL3 Prote 
NI_000055 , 
AF154333 
X027B2 



KOL3 PxotQ 
MP_OD005S . 
AF154»33 
K027B2 
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MCXL3 PXO to 

tfp_000055 

AF154933 




320 



33C 

I ,...|..,. | 

&ETSG VRIJ.QSPYKI KPTRT: 




MDL3 Prote 
BP 000055. 
AF154923 
X02782 



Prote 
BP 000055. 
AP154933 
KQ2782 



fcOL3 Prote 
BPJ500Q55 . 
&F134933 
K02782 



M02i3 Prote 
UP 000055. 
AF154933 
K02762 



MOI»3 Probe 
»p_C 00055. 
XF154933 
KD2782 



Prote 
HP_C0005S . 
W154933 
K02782 



6«XL3 Prote 
HP000055. 
M154933 
K02782 



Prote 
NPJJ00055 . 
AF1S4933 
U02702 



MOL3 Prote 
HP 000055. 
&F154933 
K02782 



37G 

....I.... I .. 



IvTSPV'ilHffctlTi- 
IVTSPYQIHpfcEPP; 
I vTS PYQI H F|?KTr 



3C0 



39C 




430 



.1. 



U0 
.1 



360 470 
I .... I ....I. ...I ... 






34C 









GCT 


gGS-SKDj 






SC? 


PGSGKOj 






GCT 


PG5GK.pl 


IBDWBKA3I 




GCT 


PGSGF^j 





910 



923 



930 



MQL3 ProtQ KPGAPSHgh/j}| VVSJl 1 ! 



940 

c B?— : 



950 



960 



910 
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MO.L3 shows significant homologies to human complement C3 proteins, as described in, 
but not limited to, the references below, 
5 It was found that transforming growth factor-betal acts as a potent inhibitor of 

complement C3 biosynthesis in human pancreatic cancer cell lines, Andoh etal. determined 
how transforming growth factor (TGF>bctal affects complement C3 secretion in the pancreatic 
cancer cell lines PANC-1 and BxPC-3. It is suggested that TGF-bctal may act as a potent 
inhibitor of C3 secretion in pancreatic cancer cell lines under inflammatory conditions. This 

10 action of TGF-betal did not correlate with NF-kappaB activation, but associated with the 
translocation of Fos protein into the nucleus. 

The cellular localization of complemen t C3 and C4 transcripts were analyzed in intestinal 
specimens from patients with Crohris disease. It has been suggested that the increase in C3 and 
C4 levels in jejunal perfusates of patients with Crohn's disease results from local intestinal 

15 synthesis of complement Laufer et al. suggest that there is local regulated production of 

complement in the intestine of patients with CD, and subsequent complement activation may 
contribute to the inflammatory process. 

The generation of complement C3 and expression of cell membrane complement 
inhibitory proteins by human bronchial epithelium cell line. They found that the interrelationship 

20 between human airway epithelium and complement proteins may affect airway defence, airway 
function, and airway epithelial integrity. Local generation of complement C3 and expression of 
cell membrane CIP by human bronchial epithelium and its modulation by proinflammatory 
cytokines might be an additional regulatory mechanism of local airway defence and may affect 
airway function and epithelial integrity in health and disease. 

25 Jnnssen et ol, Am J Kidney Dis 2000 Jan;35(l):21-8 suggested the activation of the acute 

phase response and complement C3 in patients with IgA nephropathy. The authors have shown 
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ystemic complement activation in patients with immunoglobulin A (IgA) nephropathy and 
reported that plasma levels of actC3 can indicate disease activity and renal outcome. 

Therapeutic applications 

5 

The expression pattern, and protein similarity information forMOL3 may function as a 
human complement C3-likc protein. Therefore, the nucleic acid and protein of the invention are 
useful in potential therapeutic applications implicated, for example but not limited to, cancer, 
lung diseases, including asthma, immundeficiencies, inflammation, Crohn's disease, neurological 
10 disorders, nephropathy, and other diseases and disorders. The homology to antigenic secreted 
and membrane proteins suggests that antibodies directed against the novel genes may be useful 
in treatment and prevention of cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 
disorders. 

1 5 Potential therapeutic uses fox the inventions) are, for example but not limited to, the 

following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug targeting'cytotoxic antibod}'), (iv) diagnostic and/or prognostic 
markeT, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 

20 tissues and cell types derived from these tissues. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 
disorders. For example, but not limited to, a cDNA encoding the human complement C3-like 

25 protein may be useful in gene therapy, and the human complement C3-like protein may be useful 
when administered to a subject in need thereof. By way of non-limiting example, the 
compositions of the present invention will have efficacy for treatment of patients suffering from, 
for example, but not limited to, cancer, lung diseases, including asthma, imrnnndeficiencies, 
inflammation, Crohn's disease, neurological disorders, uephropathy, and other diseases and 

30 disorders. The novel nucleic acid encoding die human complement C3-like protein, and the 

human complement C3-like protein of the invention, or fragments thereof, may further be useful 
in diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed These materials are further useful in the generation of antibodies that bind 
immunospccifically to the novel substances of the invention for use in therapeutic or diagnostic 

35 methods. 
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MOL4 

The disclosed Wnt 8-like protein, MOL4 (also referred to herein as AC004826), is 
encoded by a nucleic acid, 1064 nucleotides long (SEQ ID NO:7). An open reading frame was 
5 identified beginning with an ATG initiation codon at nucleotides 4-6 and ending with a TGA 
codon at nucleotides 1057-1059. Putative untranslated regions upstream from the initiation 
codon and downstream from the termination codon are underlined in Table 4 A, and the start and 
stop codons are in hold letters. The encoded protein having 351 amino acid residues is presented 
using the one-letter code in Table 4B (SEQ ID NO : 8). 

10 



Table 4A. MOL4 Nucleotide Sequence (SEQ ID NO:7). 

GCGATGGGSAACCT GTT T A?G CTCTGGGCAGCT C ?GGG CATATGCTGTGCTGCA?T{!!AGfrGCC , r C TGCCT GGT CA 
GTGA ACAAT TTCCT GAT AACAGGTCCCAAGGC CT ATCT GACCTACACGACTAGTGTGGCCTT GGGTGCC C AGAGT 
SGCAT CGAGGAGT3CAAG TTCCAGT ITG ^TCGCGAACGCTG GAA2 IGCCCTGAAAATG CTCT TCAGCT CT CCACC 
CAGAACAGG CTGAGAAG T GCTACCAG AGAGACTT CCTTCATACAT SCT AT CAGCTCTGCTGGACT CAT GTAC ATC 
ATCACCAAGAACT STAG GATGG GtTGACTrCGAAAACTGTGGCTG T GAT G GGT CAAACAATGGAAAAAT AGGAGGC 
CATGGCfGGATCTGGGGMGClGCAGCGAOATO^ 

T TGGAGAAG GSGAAGGATGCC AGAGCCC.T GATGAAT CTT CAC AAC AACAGGGCCGGCASVd GGCAGT GAGAG C C 
ATOTGAAAAGGACATGCA&ATGTCATGSCATC^ 

GAM N 1'C CG*3GAGATGGG AGACT ACCTJ^^CCCCAAC IATGAC CACGCGCT GAAAAT T GAAATGGATAAGCGG C AG 
CTGAGAGCT GGGAACAGC GCCG AGG GCCACCGGGTGCCCGC T G A53CCT TCCT T CCTA3CGC AGAGGCGGAACT G 
AlCTrTITAGAGGPJ\TCJlCCAGAT ( T ACT Z TACCTG 2 AATT CCAGCCTGGGCA7 CT AT GGCACAG?\GGGTCGTGAG 
TCXX^TAC/iGAAC AGCC^.CAACACAT CCA3 GTGC^GCGACGTA&-TGTC-GGCC^CTGT^ ACTGACTG 1 : GGGCT-S 
C AGGTGG AAGAGAGGftP A AC T G AGGTCAIA AGCAGC T GT AACTGCAAAl TCCAGT GG1 GCTGT ACGGTC AAGTGT 
G ACCAGTGT AGGCAIGT GGT GAGCAAGT AT TACTGC GCAC t CT CCCCAGGCAGT GC CCAGTC CCT GGGT AAGGG " 
AGTGCCTCATAATA 



The disclosed nucleic acid MOT4 sequence has 881 of 1050 bases (83%) identical to a 
Mus muscuhis Wnt 8 mRNA (GENBANlC-ID: MMWNT8DPT|acc:Z68889) and 637 of 955 
bases (66%) identical to a Homo sapiens Wnt 8 mRNA (GENBANK-ID: 
15 HSWNT8|acc:Y11094). 

The MOL4 polypeptide (SEQ ID NO:8) encoded by SDQ ID NO:7 is presented using the 
one-letter amino acid code in Tabic 4B. The Psort profile for MOL4 predicts that this sequence 
has a signal peptide and is likely to be localized outside the cell with a certainty of 0.7700. The 
most likely cleavage site for a MOL4 peptide is between amino acids 24 and 25 based on the 
20 SignalP result 



Table 4B. MOL4 protein sequence (SEQ ED NO:8) 

RLRS ATRE T S F I HAI S S AGVM Y 1 2T KI'JCSMGDFEN CGCDGSMMGBCZGGHGV? IWGGCS DN VE FGE RI S KLFV DSL3 K 
GKDARALHNLHNN RA2RLAVRATMKRTCKCIIGI EG SCS 1 QTCWL QLAE FREMGD YLKAKY DQALK I EMEKRQ1,B AG 
NSAE GHWVPAEAFJi PS ASAR LI FLE ESP ^YCTCNS SZC1 YGTEG RECLQNS Rffl ERR SCGRLCTB CGLQ VEE R 
KTE VIS SCNCK F QWC 2 TVKC EQG RH WS KYYCARS PGS AQS LGKGSA 
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The full amino acid sequence of the disclosed MOL4 polypeptide has 282 of 349 amino 
acid residues (80%) identical to, and 306 of 349 residues (87%) positive with, the 354 amino 
acid residue WNT-8D protein ftomMus musculus (ptnr:SPTREMBL-ACC: Q64527), (E = 
1.5xl0 is ) 

BLASTP (Non-Redundant Composite database) analysis of the best hits for alignments 
with MOL4 are listed in Table 4C. 



Table 4C. BLASTP mutts for MOL4 


Gene index/ 
Identifier 


Protein'/ Organism 


Length 
(aa) 


identity 
<%> 


Positives 


Expect 


gi'11693046| 
gb AAG3B662. 
11 

(AY0C9402) 


WWTRri precursor [Hoir.o 
sapiens; 


355 


335/348 
(962) 


336/348 
(96%) 


0.0 


gi 6678169 |r 
ef 'riP 033316 
.1 


stimulated by retincic 
acid gene 11 [Mus 
musculus I 


354 


271/349 
(77%) 


29E/343 

;82%; 


1.0«- 
148 


gi !C4264-|pi 
rl S18771 


deve lopment a 1 
regulator Xvnt-6 - 
African clawed frog 


387 


246/335 
(73%) 


285/335 
(84%) 


1.0s- 
136 


gi 17223441a 
p|P51030|WN8 
C CHICK 


WNT-8C PR0T3IN 
PRECURSOR (CWNI-8) ! 


357 


242/337 
(715) 


283/337 
(333) 


l*0e- 
134 



1 0 This information is presented graphically in the multiple sequence alignment given in 

Table 4D (with MOL4 being shown on line 1) as a ClustalW analysis comparing MOM with 
related sequences. 

Table 4D Information for the ClustalW proteins: 



15 



20 



1) MOL4 (3BQ ID NO: 8) 

2) gi.U693046|gb|AA<S38G62.1| (AY009402) WNT8d precursor [Homo sapiens] (SEQ 
ID N0:44) 

3) gi.6G731G9|ref |NP_C33316. 1 | stimulated by retinoic acid ger.e 11 [Mus 
nusculus] (SEQ IE NO:45> 

4) gi ;i04264lpirj IS187V1 developmental regulator Xwnt-0 - African clawed frog 
(SEC ID NO: 46) 

5) gi (172 2844 fsp|P510 30 1 NN8C_CHICK WHT-8C PROTEIN PRECURSOR (CWNT-8) (SEO IP 
N0:47> 



25 



30 



1C 

.1 . 



2C 



30 



MDL4 Prate 
AY009402 
Nff_033316. 
8187 71 KyHTfn»F t 
P51030 



40 
.1 



50 



73 



35 MDl*4 prote 
W 0094 02 
WP 033316. 



BC 



93 



100 
.1 



110 



35 



60 




120 

.1 
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10 



15 



20 



25 



30 



35 



40 



S18771 
P51030 



M0I4 Prote 
AY003402 
NP 033316. 
818771 
P5103O 





■ODQ 






TEG 






TF.G 






TEG 


-.ECLC: 




TEG 


F.ECLO 




TEG 


F:ECLCj 



M0L4 Pro to 
AY00MQ2 
NP_D33ai5. 
S1B77X 
P51030 



373 38D 

i I ....l....f ....|.. 

MOM Prote 351 

AY009402 Z 355 

NP03331S. 354 

Si 8771 ATQTOISPTST HAS5FVCLQI> PFLNLW? 3B7 

P5103O 357 




45 WNT genes encode intercellular signaling glycoproteins that play important roles in key 

processes of embryonic development such as mesoderm induction, specification of the 
embryonic axis, and patterning of the central nervous system, spinal cord, and limbs. The name 
WNT denotes the relationship of this family to the Drosophila segment polarity gene 'wingless/ 
and to its vertebrate ortholog Intl, a mouse protooncogene; see WNT1. It was noted that multiple 

50 WNT genes are known to exist in several species that have been investigated ranging from 

Drosophila to man. They have been classified into various groups and subgroups on the basis of 
high sequence homology and common expression patterns. Hie vertebrate WNT8 subfamily 
includes genes from Xenopus, zebrafish, and chicken; The first mammalian WNT8 honiolog, a 
human member of the Wnt8 family that they tcimed WNT8B was characterized on the basis of 

55 the very high sequence similarity (90-91 % identity) of the inferred protein to those encoded by 
the Xenopus and zebrafish Wnt8b genes- The human cDNA encodes a 295-amino acid 
polypeptide that contains a C2II2 zinc finger-like motif. A predominant 1.9-kb mRNA was 
detected in a variety of adult and fetal tissues. They used PGR typing of a human 
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monochromosonial hybrid cell panel to map the gene to chromosome 10, and fluorescence in situ 
hybridization for localization at 10q24. 

The full-length cDNA sequence and genomic organization of the human WNT8B gene 
was presented and reported studies of expression of the gene in human and mouse embryos. Hie 
5 WNT8B gene contains six exons separated by small introns. with the exception of intron I . The 
predictedproteinhas351 amino acids. The gene is expressed predominantly as a transcript of 
approximately 2.1 kb. The human and mouse expression patterns appeared to be identical and 
were restricted to the developing brain, with the great majority of expression being found in the 
developing forebrain. In the latter case, expression was confined to the germinative 

10 neuroepithetium of three sharply delimited regions: the dorsomedial wall of the telencephalic 
ventricles (which includes the developing hippocampus), a discrete region of the dorsal 
thalamus* and the mammillary and relTomaramilkry regions of the posterior hypothalamus. 
Expression in the developing hippocampus may suggest a role for WNT8B in patterning of this 
region, and subchromosomal localization of the human gene to 10q24 may suggest it as a 

15 candidate gene for partial epilepsy (EPT; OMIM-6005 12) in families in which die disease has 
been linked to markers in this region. 

WNT1 is a member of a family of cysteine-rich, glycosylated signaling proteins that 
mediate diverse developmental processes such as the control of cell proliferation, adhesion, cell 
polarity, and the establishment of cell fates. Wntl was identified as an oncogene activated by the 

20 insertion of mouse mammary tumor virus in virus-induced mammary adenocarcinomas. 

Although Wntl is not expressed in the normal mammary gland, expression of Wntl in transgenic 
mice causes mammary tumors. To identify downstream genes in the WNT signaling pathway 
that are relevant to the transformed cell pbenotype, A PCR-based cDNA subtraction strategy was 
used, suppression subtractive hybridization. It was reported that the identification of two genes, 

25 WISP1 and W1SP2, that are upregulated in the mouse mairrxnary epithelial cell line transformed 
by Wntl, but not by Wnt4, Together with a third related gene, WTSP3, Ihese proteins deline a 
subfamily of the connective tissue growth factor family. Two distinct systems demonstrated 
WISP induction to be associated with the expression of WNT1 . WISP1 genomic DNA was 
amplified in colon cancer cell lines and in human colon tumors and its RNA ovcrcxpressed in 

30 84% of the tumors examined compared with patient-matched normal mucosa. WISP3 also was 
overexpressed in 63% of colon tumors analyzed. In contrast, WISP2 showed reduced RNA 
expression in 79% of the tumors. These results suggested that WISP genes may be downstream 
of WNT1 signaling and that aberrant levels of WISP expression in colon cancer may play a role 
in colon tumorigenesis. 
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It was found that Hie WISP1 cDNA encodes a 367-amino acid protein. Mouse and human 
WISP1 proteins are 84% identical; both have hydrophobic N-terminal signal sequences, 38 
conserved cysteine residues, and 4 potential N-linked glycosylation sites. Alignment of the three 
human WISP proteins showed that WISP1 and WISP3 are most similar (42%), whereas WTSP2 
5 had 37% identity witii WISP1 and 32% identity with WISP3. 

Uses of the Compositions of the Invention 

The above defined information for this invention suggests that M0T4 may function as a 
member of the "Wnt 8 family". Therefore, the novel nucleic acids and proteins identified here 

10 may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targetingfcytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration 

15 in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in neurodegenerative disorders, epilepsy, cancers including but not 
limited to brain tumor, colon cancer and breast cancer, developmental disorders, neural tube 
defects, and/or other pathologies and disorders. For example, a cDNA encoding the Wnt 8-Iike 

20 protein may be useful in gene therapy, and the Wnt 8-like protein may be useful when 

administered to a subject in need thereof. By way of nonlimiting example, the compositions of 
the present invention will have efficacy for treatment of patients suffering from 
neurodegenerative disorders, epilepsy, cancers including but not limited to brain tumor, colon 
cancer and breast cancer, developmental disorders, and neural lube defects,. The novel nucleic 

25 acid encoding Wnt 8-iike protein, and the Wnt 8-like protein of the invention, or fragments 

thereof, may further be usefitl in diagnostic applications, wherein the presence or amount of the 
nucleic acid ot the protein are to be assessed. These materials are further useful in the generation 
of antibodies that bind iramunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

30 

MOTS 

The disclosed novel Beta Thymosin-like MOL5 nucleic acid of 215 nucleotides (also 
referred to as AC025535) is shown in Table 5A. An ORF begins with an ATG initiation codon at 
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nucleotides 4-7 and ends with a TGA codon at nucleotides 211-213. A putative untranslated 
region upstream from tbc initiation codon and downstream from the termination codon is 
underlined in Table 5 A, and the start and stop codons are in bold letters. 



Table 5A. MOL5 Nucleotide Sequence (SEQ ID NO:9) 

AGTA TGGT C TCAGCCCAGCGrTTCA-CXaAGT CTT C&AG CCrTCAGGCTTTCTTTAM^AAGATGAGT QATftACCC 



The MOL protein encoded by SEQ ID NO: 9 has 69 amino acid residues and is presented 
using the one-letter code in Table 5B. The Psort profile for MOL5 predicts that this sequence 
has a signal peptide and is likely to be localized at the mitochondrial intermcmbrane space with a 
certainty of O.88U0. Using the SIGNALP analysis, the protein of the invention does not appear to 
1 0 contain a predictable signal peptide. 



Table 5B. Encoded MOL5 protein sequence (SEQ ID NO:10) 

MVSAQRFT3L QAFRLgU KM 3DM"PKL3 FTVKFDR3 KLKKTKlTGB KNRT^ 3 31EHT IQQE KyOVQTS YNGGWA 



The disclosed nucleic add sequence for MOL5 hasl67 of 191 bases (87%) identical to a 
Homo sapiens Beta Thymosin mRNA (GENBANK-ID: D82345|acc;D82345) (E= 5.1c" 26 ). 
15 The foil MOL5 amino acid sequence has 37 of 45 amino acid residues (82%) identical to, 

and 38 of 45 residues (84%) positive with, the 45 amino acid residue Thymosin beta protein 
from Homo sapiens (ptnr: PIR-H* JC5274) (E= 1 .2c" 1 

MOL5 also has homology to other proteins as shown m BLAST alignment results in 
Table 5C\ 



Table 5C. BLAST results for MOL5 


Gene Index/ 
Identifier 


Protein/ Organism 


Length \ Identity 
■aai ■ (%) 


Positives 
<%) 


Expect 


ref |NP_068832.1| 


thymes in, beta, 
iden-ified in 
r.eurcJbl ast cuaa 
cells {Homo 
sapiens] 


45 


37/45 
(829) 


38/43 
(345) 


Se-06 


pirl 1152084 


thyaicaiu bet a- 4 
precursor - rat 
(fragment) 


56 


27/39 
(€9%) 


33/39 
(84%) 


2e-04 


sp 1 ?2G0 6S | TYB4_M0US 
E 


THYMOSIN BETA- 4 


so 


27/39 
(69*) 


33/39 
(84%) 


3e-04 


gb|AAA3S746.1| 
(M92383) 


thyme sin bet a- 10 
[Home sspiens] 


49 


24/40 
(605) 


32/40 
(80%) 


0.002 


gb (AAB37101.il 
(U25664) 


thy.ncs.in beta- 
like protein 
[Rattus 
norvegicus] 


45 


31/39 
(79%) 


34/39 
(86%) 


0.C02 
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This information is presented graphically in the multiple sequence alignment given in 
Table 5D (with MOL5 being shown on line 1) as a ChistalW analysis comparing MOL5 with 
related protein sequences, 

5 



Table 5D Information for the CtustalW proteins: 



1) MOl£ (SEa ID RO:10> 

2) ref|HP 058832.11 thymosin, beta, identified in neuroblast cos cells [Homo 
10 sapiens] (3EQ ID NO: 48) 

3) pir||I52084 thymosin beta-4 precursor - rat (Ixagitient; (3EQ id NO:49) 
1) Sp | P2 0 0 65 1 'r¥B4_MOC$K THYMOS ih BETA- 4 ( SEC ID NO: 50) 

5) gblAAA3G74S.ll (MS2363; thymosin beta-10 [Homo sapiens] (SE2 ID KO:51} 

6) gb IAAB37101.il (U2S664; thymosin beta-like protein [Rattus norvsgicus] (SEQ 
15 ID 130:52) 




70 

,...|.... ! 

MOL5 Prote VQTSraGGBA 6S 

: NP_068832, VQJT5 45 

.' 152084 GBSI 56 

i P20065 qBflf 50 

M923B3 BIS 45 

U25fi84 B$RS 45 



Thymosin-beta-4 induces the expression of terminal deoxynusleotidyl transferase activity 
20 in vivo and m vitro, inhibits tlie migration of macrophages, and stimulates the secretion of 
hypothalamic luteinizing hormone-releasing hormone. It was noted that the protein was 
originally isolated from a partially purified extract of calf thymus, thymosin fraction 5, which 
induced differentiation of T cells and was partially effective in some immunocompromised 
animals. Further studies demonstrated that the molecule is ubiquitous; it had been found in all 
25 tissues and cell lines analyzed It is found in highest concentrations in spleen, thymus, lung, and 
peritoneal macrophages. It was stated that thymosin-beta-4 is an actin monomer sequestering 
protein that may have a critical role in modulating the dynamics of actin polymerization and 
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depolymerization in nonmuscle cells. Its regulatory role is consistent with the many examples of 
transcriptional regulation of T-beta-4 and of tissue-specific expression. Lymphocytes have a 
unique T-beta-4 transcript relative to tiie ubiquitous transcript found in many other tissues and 
cells. It was stated that rat thymosin-beta-4 is synthesized as a 44-amino acid propeptide which is 
5 processed into a 43-amino acid peptide by removal of the first methionyl residue. The molecule 
does not have a signal peptide. Human thymosin-beta-4 has a high degree of homology to rat 
thymosin-beta-4; the coding regions differ by only 9 nucleotides, and these are all silent base 
changes. 

By differential screening of a cDNA library prepared from leukocytes of an acute 
1 0 lymphocytic leukemia patient, a cDNA encoding thymosin-beta-4 was isolated. Using Northern 
blot analysis, the expression of the 830-nucleotide thymosin-beta-4 mRNA in various primary 
myeloid and lymphoid malignant cell lines and in hemopoietic cell lines was studied. It was 
stated that the pattern of lhymosin-beta-4 gene expression suggests that it may he involved in an 
early phase of the host defense mechanism, 

1 5 A cDNA clone for the human interferon-inducible gene 6-26 was isolated and showed 

that its sequence was identical to that tor the human ihymosin-beta-4 gene. By use of a panel of 
human rodent somatic cell hybrids, it was shown that the 6-26 cDNA recognized seven genes, 
members of a niultigene family, present on chromosomes 1, 2, 4, 9, 1 1, 20, and X. These genes 
are symbolized T MST A , TMSL2, etc., respectively. T.i et al, (1 996) established that in the mouse 

20 there is a single TmsM gene and that the lymphoid-specific transcript is generated by extending 
the ubiquitous exon 1 with an alternate downstream splice site. By interspecific backcross 
mapping, they located the mouse gene, which they symbolized Ptmb4, to the distal region of the 
mouse X chromosome, linked to Btk and Gja6. Thus, the human gene could be predicted to 
reside on the X chromosome in the general region of Xq2 1 ,3-q22, where BTK is located By 

25 analysis of somatic cell hybrids, the fhymosin-beta-4, or TB4X, gene were mapped to the X 
chromosome. They noted that a homologous gene, TB4Y, is present on the Y chromosome. 

It was stated that prostate carcinoma is the most prevalent form of cancer in males and 
the secoud leading cause of cancer death among older males. The use of the serum prostate- 
specific antigen test permits early detection of human prostate cancer; however, early detection 
30 has not been accompanied by an improvement in determining which tumoTS may progress to the 
metastatic stage. The process of tumor metastasis is a multistage event involving local invasion 
and destruction of extracellular matrix; intravasation into blood vessels, lymphatics or other 
channels of transport, survival in the circulation; extravasation out of the vessels into the 
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secondary site; and growth in the new location. Common to many components of the metastatic 
process is the requirement for tumor cell motility. A well-characterized scries of cell lines that 
showed varying metastatic potential was developed from the Dunning rat prostate carcinoma. A 
direct correlation between cell motility and metastatic potential in the Dunning cell lines was 
5 shown. In studies comparing gene expression in poorly and highly motile metastatic call lines 
derived from Dunning rat prostate carcinoma using differential jnRNA display, Bao et al. (1 996) 
found a novel member of the thymosm-beta family of actm-binding molecules. The molecule, 
named thymosin-heta-1 5 by them, was found to deregulate motility in prostate cells directly. In 
addition, it was expressed in advanced human prostate cancer specimens, but not in normal 

1 0 human prostate or benign prostatic hyperplasia, suggesting its potential use as a new marker for 
prostate carcinoma progression. Bao et al. (1 996) found that mymosm-beta-15 levels correlated 
positively with the Gleason tumor grade. Coffey (1996) pointed out that the upregulation of 
thymosin-bcta-15 as a positive motility factor and the down regulation of the motility suppressor 
KAI1 (OMIM- 600623) provide the 'yin and yang r for metastasis; he speculated that these 

1 5 pathways may provide a new target for therapy. 

Angiogenesis is an essential step in the repair process that occurs after injury. In a study, 
the angiogenic thymic peptide thymosin beta4 (Tbeta4) enhanced wound healing in a rat foil 
thickness wound model was examined. Addition of Tbcta4 topically or intraperitoneally 
increased reepithelialization by 42% over saline controls at 4 d and by as much as 61% at 7 d 

20 post-wounding. Treated wounds also contracted at least 1 1% more man controls by day 7. 

Increased collagen deposition and angiogenesis were observed in the treated wounds. We also 
found that Tbeta4 stimulated keratmocytc migration in the Boydcn chamber assay. After 4-5 b, 
migration was stimulated 2-3-fold over migration with medium alone when as little as 10 pg of 
Tbela4 was added to the assay. These results suggest mat Tbeta4 is a potent wound healing 

25 factor with multiple activities mat may be useful in the clinic. 

Uses of the Compositions of the Invention 

The above defined information for this invention suggests thatMOL5 may function a3 a 
member of a "Beta Thymosin family 5 '. Therefore, the novel nucleic acids and proteins identified 
30 here may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
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prognostic marker, gene therapy (gene delivery/gene ablation}, research tools, tissue regeneration 
in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in cancer including but not limited to prostate cancer, immunological 
5 and autoimmune disorders (i. e., hyperdiyroidism), angiogenesis and wound healing, modulation 
of apoptosis, neurodegenerative and neuropsychiatry disorders, age-related disorders, and other 
pathological disorders involving spleen, thymus, king, and peritoneal macrophages and/or other 
pathologies and disorders. For example, a cDNA encoding the Beta Thymosin-like protein may 
be useful in gene therapy, and die Beta Thymosin-like protein may be useful when administered 

10 to a subject m need thereof. By way of nonlimiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from cancer including but not 
limited to prostate cancer, immunological and autoimmune disorders {Le., hyperthyroidism), 
angiogenesis and wound healing, modulation of apoptosis, neurodegenerative and 
neuropsychiatric disorders, age-related disorders, and other pathological disorders involving 

15 spleen, thymus, lung, and peritoneal macrophages. The novel nucleic acid encoding Beta ITiymosm- 
likc protein, and the Reta Tbymosin-iike protein of the invention, or fragments thereof may 
further he useful in diagnostic applications, wherein the presence or amount of the nucleic acid 
or the protein are to be assessed. These materials are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 

20 therapeutic or diagnostic methods. 

MOL6 

MOL6a 

The disclosed novel Tr^sin-like MOL6a nucleic acid of 730 nucleotides (also referred to 
as GM_87760758_A) is shown in Table 6A. An open reading begins with an ATG initiation 
25 codon at nucleotides 8-10 and ends with a TG A codon at nucleotides 713-715. A putative 
untranslated region upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 6A, and the start and stop codons are in bold letters. 



Table 6A. MOL6a Nucleotide Sequence (SEQ ID NO;ll) 

GATCACCATCAfiATATCjTCTTCTATTTGG 

AG AAG AC CCTGCTCC C TA T T TGGT GT ACCTC AAGTCT CACTT C AACCCCT GTGTG GG 2GTC CT CAT CAAAC CC AS 
CT GGGT GC7G SCCCCAGC T CACFG CT ATTT ACCAAAT CT GAAAGT GATGCT GGGAAAI TTCAAGA3 CAGAG? CAG 
AGACGGT ACT 3AACAGAC AATTAACCCCATTCAGAT CGT CCG CTACTGGAACT ACAGrCATAGCGCCCCACAGG A 
IX^CTC^'GClCftlCAASCT^^^ 

CAATGT CAGGCCAGGC AC TGTC?GTCTACTCTCAGGT T TGGACTGGAGCCAAGAAA AC AGTGGCCG ACACCCTG A 
CTT GCGGCAGAACCTG GASGCCCCCGTGATG'l'CT GAT CGAGAAIX^CAAAAAAC AGAACAAGGAAAAAGCCACAG 
GAATTCCTT AT GrCTGJWtT^GTGAAAGl 1 ATT CP.fiC^ CG AAITTTT TGGGGAGGTGGCCGTTGCTACTGT GATCTG 
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I C A7\AGACJUiGCTCC AGGG? ATCGAGGTGSGGCACTTCATGGG AGG GGJtCGTCG GCA?C TAC ACCAATGT TTACAA 
| ArflTGTATC CTGGAT !TGAGAftCAC TGCTA&GGACflflG TGflgACCC f ACTTCf CCC 



Hie disclosed nucleic acid sequence has 354 of 581 bases (60%) identical to a Mus 
musculus prcpro-Trypsiniiiogen mRNA (GENBANK-ID: MMTRYAR|acc^04574) (E value = 
9.9c- 24 ). 

5 The MOL6a protein encoded by SEQ ID NO: 1 1 has 235 amino acid residues, and is 

presented using the one-letter uode in Table 6B {SEQ ID NO: 12). The Psort profile for MODSa 
predicts (hat this sequence has a signal peptide and is likely to be localized on the outside with a 

certainty of 0.3700. The most likely cleavage site for a peptide is between amino acids 19 and 

20, ADS-SV based on the SignalP result 

10 



Table 6B. Encoded MOL6& protein sequence (SEQ ID NO:12). 

>£<YV PYI*GVLAGTFFFAT)SS VQKEDPAPYLVYl7KSH FN PCVGVlil WVIAPAHCiXEWLK^lIjGNrKSRVjlDGI 
EQTI K ? I QI VR YWNY SHS APQDDLML IKLAKE AKLN P KVQPLPLRT TN VR PGTVCIiL S GLDKSQtWSGKH PDLRQN 
LZAPVHSDRECQKTilCiGKSHRNSLCVKFVKVFS RI FGEVAVATVI CKDKLQGI E VGH FMGG DVGT YTNVYKYV3W I 
ENTAKDK 



The full amino acid sequence of MOL6a was found to have 79 of 208 amino acid 
residues (37%) identical to, and 1 1 8 of 208 residues (56%) positive with, the 248 amino acid 
residue TRYPSINOGEN I-Pl PRECURSOR (EC 3.4.21.4) protein from Gallus gallus (ptnr: 
15 SWISSNEW-ACC: Q90627) (E value = Lie- 33 ). 

MOL6 also has high homology to the proteins shown in the BLAST data in Table 6C. 



SNP anatysis of MOL6a is described in Example 2. 



Table 6C BLAST results for MOLtia 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
<*> 


Expect 


gi|2492B62|sp|QS062 
7 . TRY1 CHICK 


TRYPSIN I-Pl 
PRECURSOR 


248 


79/203 
(37%) 


118/200 
<S53) 


2e-31 


gi|2118087|pixl |S55 
067 


trypsin (jJC 
3.4.21.4) I 
precursor , 
pancreatic - 
chicken 


248 


78/233 
(373) 


117/208 
<55S> 


5e-31 


gi I 5678439 IreflNP 0 
33456, If 


trypsin 2 [Mus 
mus cuius I 


246 


77/215 
f35ft) 


115/215 
(52%) 


86-29 


gi| 1533123 Ipdbjlsvrf 

IB 


Chain B, Rat 
Anionic N:43h, 
BISlh Trypsin 
Complexed To A86h 
Ecotin 


223 


74/212 
(34%) 


116/212 
(53%) 


le-29 



MOLflb 

20 In the present invention, the target sequence identified previously, MOL6a, Accession 

Number GM_877C0758_A, was subjected to the exon linking process to confirm the sequence. 
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PCR primers were designed by starting at the most upstream sequence available, for Ihc forwajd 
primer, and at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is cither unique or highly selective was encountered, or ? 
5 in the case of the reverse primer, unlil the stop codon was reached Such suitable sequences 
were then employed as the forward and reverse primers in a PCR amplification based on library 
containing a wide range of cDNA species. The resulting amplicon was gel purified, cloned and 
sequenced to high redundancy to provide the sequence reported below, which is designated 
MOL6b, Accession Number GM_87760758_A_da 
10 The disclosed novel Trypsin-Kke MOL6b nucleic acid of 730 nucleotides (also referred 

to as GM_8776075S_A_da) is shown in Table 6D. An open reading frame begins with an ATG 
initiation codon at nucleotides 8-10 and ends with a TGA codon at nucleotides 713-715. A 
putative untranslated region upstream from the initiation codon and downstream from the 
termination codon are underlined in Table 6A, and the start and stop codons are in bold letters. 

15 



Table 6D. MOLGb Nucleotide Sequence (SEQ ID NO:13) 

CfoTC&CCA TOAAATATCTCTTCTAra 

AAGAAGACCCTGCTC CCTATTT3GTGTACCTCAAGT CTCAC TTCAAC C C CTGTG TGGGCGTC CTCATCAAAC CC 
AG CTOGGFGCTGGCC CCAGC TCACTG CT ATTTACCA A AT CT G&AAGTGATOCTGGGA AATT TC AAGAGCAGAGT 
CAGAGACGGTAC-rOAACAGACAA'il'AAC CCCATT CAGAT CG'i'CCGCTAC ^GGAACTACAGl CATAGCGC CCCAC 
AGCATGACCT C ATSCTCAT CAAGCTGGC TAAGCCTGCCATG CTCAAT CCC AAAGTCCAG CC CCTTCCC CTCGCC 
ACCACCAATGT CAGGCCAGGCAC IGT-^TGTCTACI CTCAGG rrTGGACTGGAGC CAAGAAAACAGTGG 2CGACA 
CCCTGACTTGCX^CAGAACCTWAG^ 
GCCAaAGG^a^CXn^TGTGTGAAATTTGTGAAAGT^ 

GTCAXC TG CAAAGACAAGCT CCAGGGAATCGAGOTCCGG CACTTCATG GGAGGGGACGT CGGCAT CTACACCAA 
TOTTTACftAATAgqTATCC^ 

The MOL6b protein encoded by SEQ ID NO: 13 has 235 amino acid residues, and is 
20 presented using the one-letter code in Table 6E (SEQ ID NO: 14). The Psort profile for MOL6a 
predicts that this sequence has a signal peptide and is likely to be localized on the outside with a 
certainty of 0.3700, The most likely cleavage site for a peptide is between amino acids 19 and 
20 based on the SignalP result. 



Table 6E. Encoded MOL6b protein sequence (SEQ ID NO:14). 

MKYVTYLGVL^^ 

EQ/T I H P 1 21 VRYW M YS H SAP QDDI^XjIXLAKP AULK PKVQFL PLAT'TK^/ltPGlVCLL SGL CW SQWJ S GRHPU-LRQN 
L BA PVMS DREC QKT RQGKS H RNSLCVK PVKVFS R I 7GF1 VAVAT V ICK DKLQ 3IF.VG H FMG GDVGT. YTNV YKYVS Wl 
2KTAKPK 
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The full amino acid sequence of MOL6b was found to have homology with several 
proteins including those disclosed in the BLASTP data in Table 6F. 



Table 6F. BLAST results for MOL6b 


Gene index/ 
Identifier 


Protein/ Organism Length 
iac) 


Identity 


Positives 
W 


Expect 


ACC:Q90627 


TRYPSIN I- PI 248 
PRECURSOR IZC 
3.4.21.4) - 
Gallus callus 
(Chicken) 


79/208 
(37%) 


110/208 
(56%) 


3,8e- 
34 


PIR-ID:S55CS7 


trypsin (EC 
3.4*21.4) 2 
precursor/ 
pancreatic - 
chicken 


248 


78/208 
(37%) 


117/208 
(55%) 


7.9e- 
34 


ACC:Q90628 


TRIPSIN I-P38 
PRECURSOR (EC 
3.4.21.4) - 
Gallus callus 
(Chicken) 


248 


78/2C8 
(37%) 


117/208 
(56*) 


l.Oe- 

33 


ACC:P07477 


TRYPSIN I 
PRECURSOR (EC 
3.4.21.4) 
(CATICNIC 
TRIPSINOGEN ) - 

Homo sapiens 
(Human) 


247 


76/212 
(35%) 


112/212 
(52%) 


1.3e- 
31 



5 

MOL6b also has high homology to the proteins shown in the BLASTX alignment data in 



Tabic 6G. 



tabic 6G. BLASTX results for MOLfib 






Smallest 








Sum 




Reading 


=Ugh 


Prcb 




Sequences producing High- scoring Segment Pairs: Frame 


Score 


*(H) 


N 


ptr.r:SWrSSPaOT-ACC:Q90627 TRYPSIN I-Pl PRECURSOR <EC 3. -1-2 


372 




.1 


ptnr:PIR-ID:S55067 trypsin (EC 3.4.21.4} I precursor, .+2 


369 


4.7e-33 


1 


ptr.r:SWrSSP^OT-ACC:Q90620 TRYPSIN I-P3G PRECURSOR (EC .+2 


360 




I 


pt;:r:SWISSPROT-ACC:P0714 6 TRYPSIN II, ANIONIC PRECURSO.+2 


350 


4 , 9a-31 


1 


ptnr2SWISSP:<O'l , -ACCiP07477 TRYPSIN I PRECURSOR (EC 3. 4.. +2 


348 


7 . 9e-31 


1 


pt-r:aPTREM2L-ACC:Q9R0T7 PANCREATIC TRYPSIN - Mus muse. -1-2 


348 


7.9S-31 


1 


ptr.r:SBISSPROT-ACC:Pl555-. TRYPSIN III PRECURSOR (EC 3.. +2 


347 


I.0a-30 


i 



This information is presented graphically in the multiple sequence alignment given in 
10 Table 6H (with MOL6a being shown on line 1 and MOL6b being shown on line 2) as a 
ClustalW analysis comparing MOL6 with related protein sequences. 

Table 6H Information for the ClustalW proteins: 
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1) M01»6a tSiiU ~0 NO: 12) 

2) MO L 6b (SEQ ZD HO: 14) 

3) gi | 2499862 |sp]Q90627 |TRYl_CRICK TRYPSIN J-Pl PRECURSOR {SZQ ID MO: 53) 

4) gi | 211808*? Ipir | | S5506V trypsin (EC 3.4.21.4) I precursor, pancreatic - 
chicken (STCQ in NC:54) 

5) gil 6678439 |.cef I NP_03345€.ll trypsin 2 [Mus mus cuius] (SEQ ID KO: 55) 

6) gi | 1633122 lpdb|lSLW]B Chain ti t Rat Anionic Hl43h, £151h Trypsin Complexed To 
A86h Ecotin (SEQ ID NO: 56) 





MOL6b also contained several single polynucleotide polymorphisms described in TaWe 

6L 



Table 6L SNP for MOL6b 


Position 


Nucleotide Change 


Number of Occurrences 


70 


OA 


2 


70 


OG 


6 


261 


T>C 


2 


406 


A>C 


2 
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573 


G>T 


2 


585 


OT 


2 


737 


AXi 


4 



Trypsin (EC 3.4.21.4 ), like elastese, is a member of the pancreatic family of serine 
proteases. The gene encoding trypsin- 1 (TRY1) m also referred to as serine protease-1 (PRSS1). 
MacDonaJd et ai. (1 982) reported nucleotide sequences of cDN As representing 2 pancreatic rat 
5 tiypsinogens.UsingaratcDNAprobe^IIoneyetal. (1984, 1984)foundthata3.8-kbDNA 
fragment containing human ttypsin-1 gene sequences coscgrcgatcd with chromosome 7, and 
assigned the gene further to 7q22-7qter by study of hybrids with a deletion of this segment. The 
trypsin gene is on mouse chromosome 6 (Honey et al., 1984). Caiboxypeptidase A and trypsin 
are a syntenic pair conserved in mouse and man Emi et al. (1986) isolated cDNA clones for two 
10 major human tripsinogen isozymes from a pancreatic cDNA library. The deduced amino acid 
sequences had 89% homology and the same number of amino acids (247), including a 1 5-amino 
acid signal peptide and an 8-amino acid activation peptide. Southern blot analysis of human 
genomic DMA with the cloned cDNA as a probe showed that the human trypsinogen genes 
constitute a family of more than ten, some of which may be pseudogenes or may be expressed in 
. 15 other stages of development 

Rowcn ct al. (1996) found that there are eight trypsinogen genes embedded in the beta T 
cell receptor locus or cluster of genes (TCRB; OMIM- 186930), which maps to 7q35. hi the 685- 
kb DNA segment that they sequenced they found five tandemly arrayed 10-kb locus-specific 
repeats (homology units) at the 3-prime end of the locus. These repeats exhibited 90 to 91% 

20 overall nucleotide similarity, and embedded within each is a trypsinogen gene. Alignment of 

pancreatic trypsinogen cDKAs with the gcrmlinc sequences showed that these trypsinogen genes 
contain five exons that span approximately 3.6 kb. Further analyses revealed 2 trypsinogen 
pseudogenes and one relic trypsinogen gene at the 5-prime end of the sequence, all in inverted 
transcriptional orientation. They denoted eight trypsinogen genes Tl through T8 from 5-prime to 

25 3-prime. Rowen et aL (1996) found that only two of three pancreatically expressed trypsinogen 
cDNAs correspond to trypsinogen genes in the TCRB locus; T4 was denoted trypsinogen 1 and 
T8 was denoted trypsinogen 2 (OMIM- 601564). The third pancreatic cDNA, identified 
independently as trypsinogen 3 (Taui et al., 1990) and 4 (Wiegand eL al., 1993), is distinct from 
the third apparently functional trypsinogen gene (T6) in the TCRB locus but related to the other 

30 pancreatic trypsinogens. Rowen et al. (1 996) stated that the T6 gene is deleted in a common 

inserlionKleleiion polymorphism; if it is fimctional, its function is apparently not essential. Some 
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of the tiypsinogen genes ate expressed in nonpancreatic tissues where their fiinction is unknown. 
Rowen et aL (1 996) noted that the intercalation of the trypsinogen genes in the TCRB locus is 
conserved in mouse and chicken, suggesting shared functional or regulatory constraints, as has 
been postulated for genes in the major histocompatibility complex (such as class 1, U, and Ht 
5 genes) that share similar long-term organizational relationships. 

Rowen et al. (1 996) mapped the gene corresponding to the third pancreatic tiypsinogen 
cDNA by fluorescence in situ hybridization. Hey used a cosmid clone containing 3 trypsinogen 
genes. Strong hybridization to chromosome 7 and weaker hybridization to chromosome 9 were 
observed. They isolated and partially sequenced 4 cosmid clones from the chromosome 9 region. 

10 They found that the region represents a duplication and translocation of a DNA segment from the 
3-prime end of the TCRB locus that includes at least seven V(beta) elements and a functional 
trypsinogen gene denoted T9. The assignment of the PRSS1 gene to 7q35 is established by the 
demonstration of its sequence within the sequence of the 'locus' (OMIM- 186930) for (he T-cell 
receptor beta-chain (Rowen et aL, 1996). It is further supported by the linkage between 

15 microsatellite markers in the 7q35 region and hereditary pancreatitis (OMIM- 167800) and the 
demonstration of mutations in the PRSS1 gene in hereditary pancreatitis. 

Whitcomb et al, (1996) stated that the high degree of DNA sequence homology (more 
than 91%) present among this cluster of five tiypsinogen genes identified by Rowen et al. (1996) 
demanded that highly specific sequence analysis strategies be developed for mutational 

20 screening in families with hereditary pancreatitis. This was necessary to ensure that each 

sequencing run contained only the two alleles corresponding to a single gene, thereby permitting 
detection of helerozygotes in this autosomal dominant disorder, and not a dozen or more alleles 
from multiple related trypsinogen-like genes, which would make detection of heterozygotes 
neatly impossible. In a family with hereditary pancreatitis, Whitcomb et al. (1 996) found that 

25 affected individuals had a single G-to-A transition mutation in the third exon of cationic 
bypsinogen (276000*0001). This mutation was predicted to result in an argl05-to-his 
substitution in the trypsin gene (residue number 1 17 in the more common chymotrypsin number 
system). Subsequently, the same mutation was found in a total of five different hereditary 
pancreatitis kindreds (four from the U.S. and one from Italy) containing a total of 20 affected 

30 individuals and six obligate carriers. The mutation was found in none of the obligate unaffected 
members Oadividuals who married into the family). Subsequent haplotyping revealed that all 
four of the American families displayed the same high risk haplotype over a 4-cM region 
encompassing seven STR markers, confirming the likelihood that these kindreds shared a 
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common ancestor, although no link could be found through eight generations. A fifth family 
from Italy displayed a unique haplotypc indicating that the same mutation had occurred oh at 
least 2 occasions. The G-to-A mutation at codon 1 17 created a novel enzyme recognition site for 
AflHI which provided a facile means to screen for the mutation. As with the obligate unaffected 
5 members of the pancreatitis kindreds, none of 140 controls possessed ihe G-lo-A mutation as 
assayed by the lack of AfUII digestion of the amplified exonic DNA. 

Failure to thrive, nutritional edema, and hypoproteinemia with normal sweat electrolytes 
were features of 2 affected male infants reported by Townes (1965) and Townes et aL (1967), A 
protein hydrolysate diet was beneficial. A male sib of the first patient reported by Townes (1965) 
1 0 had died, apparently of the same condition. Morris and Fisher (1967) reported an affected female 
who also had imperforate anus. The clinical picture in enterokma.se deficiency (OMIM- 226200) 
is closely similar; however, the defect is not in the synthesis of trypsmogen but in the synthesis 
of the enterokinase which activates proteolytic enzymes produced by the pancreas. Oral 
pancrcatin represents a therapeutically successful form of enzyme replacement (Townes, 1972). 

15 Since bercdilary pancreatitis has been mapped rather precisely to 7q35 and since a defect in the 
trypsinogen gene has been identified in hereditary pancreatitis, the assignment of the trypsinogen 
gene can be refined from 7q32-qter to 7q35. 

Ferec et al. (1999) studied 14 families with hereditary pancreatitis and found mutations in 
the PRSS1 gene in 8 mmilies. Ih4 of these families, the mutation (R117H; 276000.0001) had 
20 been described by Whitcomb et al. (1 996). Three novel mutations were described in 4 other 
families. 

Sahin-Toth et al. (1999) studied the roles of the Rl 17H and N21I (276000.0002) 
mutations in hereditary pancreatitis. They stated that the Rl 1 7H mutation is believed to cause 
pancreatitis by eliminating an essential autorytic cleavage site in trypsin, thereby rendering the 

25 protease resistant to inactivation through autolysis. Sahin-Toth et al. (1999) demonstrated that 
the Rl 17H mutation also significantly inhibited autocatalytic trypsinogen breakdown under 
Ca(2+)-free conditions and stabilized the zymogen form of rat trypsin. Taken together with 
findings demonstrating that the N21I mutation stabilized rat trypsinogen against autoactrvation 
and consequent autocatalytic degradation, the observations suggested a unifying molecular 

30 pathomechanisni for hereditary pancreatitis in which zymogen stabilization plays a central iole. 

Uses of the Compositions of the Invention 
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The above defined information for this invention suggests that this Tiypsin -like protein 
may function as a member of a 'Trypsin family". Therefore, the novel nucleic acids and proteins 
identified here may be useful m potential therapeutic applications implicated in (but not limited 
to) various pathologies and disorders as indicated below. The potential therapeutic applications 
5 for this invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targetmg/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration 
in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
The nucleic acids and proteins of the invention axe useful in potential therapeutic 

10 applications implicated in failure to thrive, nutritional edema, and hypoproteinemia, trypsinogen 
deficiency disease, chronic and heriditary pancreatitis, enterkinase defieciency, cancer and/or 
related pathologies and disorders and/or other pathologies and disorders. For example, a cDNA 
encoding the Tr>psin-like protein may be useful in gene therapy, and the Trypsin-like protein 
may be useful when administered to a subject in need thereof. By way of nonlimitiiig example, 

1 5 the compositions of the present invention will have efficacy for treatment of patients suffering 
from failure Id thrive, nutritional edema, and hypoproteinemia, trypsinogen deficiency disease, 
chronic and heriditary pancreatitis, enterkinase defieciency, cancer. The novel nucleic arid 
encoding Trypsin-like protein, and the Trypsin-like protein of the invention, or fragments 
thereo f, may further he useful in diagnostic applications, wherein the presence or amount of the 

20 nucleic acid or die protein are to be assessed. These materials are further useftl in the generation 
of antibodies that bind immunospeuifieally to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

MOL7 

25 A novel nucleic acid encoding a KaUikrein-Uke-protein MOL7 was identified by TblastN 

using CuraGen Corporation's sequence file tor MOL7 probes or homoiogs, and run against die 
Genomic Dairy Files made available by GenBank. The nucleic acid was further predicted by the 
program GenScan™, including selection of exons. These were further modified by means of 
similarities using BLAST searches. The sequences were then manually corrected for apparent 

30 inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
novel MOL7 nucleic acid of 1 81 1 nucleotides (also referred to as 30675745.0.499) is shown in 
Table 7 A. An open reading frame begins with an ATG initiation codon at nucleotides 368-370 
and ends with a TAG codon at nucleotides 1 553-1 555. A putative untranslated region upstream 
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from the initiation codon and downstream from the termination codon are underlined in Table 
7 A, and the start and stop codons are in bold letters. 



Table 7A* MOL7 Nucleotide Sequence (SEQ ID NO:15) 

ACAAATCCTTCTGTTGAACTCTACT3TGrCAGGCCAGCCT3fl,GTTCATTTCT CCTTGRj CAGGPJ\CAGTT 

TTATGGCT ACAG AG T TAAGGC AAGS3T T GAAT T CCACG AG T C AAAAAGCAG C CC T T T T 2AGAG AC CC AAG 
TCTCTGGGGTGC^CAGGGGC-TGGS CTGGATTGAGAAGAA AACTGACAAGAGTAAGCT3CCCTCTCTrCT 
CTGGCCATCTCACAAACCACAGTGC3GGCCAACTGGTCCTGgCTCTTTACCACACAGAACCAAGCACTAG 
GGATAAGACAGCTGCCC ATGGTG?CCX;CGCCGGG 

GGTGCTGCTCGC-CCTTCTC"AXTCTTCCACCAG?TGTGGCGTCCAGAAAGCTTCGGTTTTCTACGGTCC^ 

GACCCCAAGGA<^GCTTGG2CAGCA'3CATGGAG n TCCraTGGGlGGTGTCGCT^ 

CACACXTSGCTTTCGGCTGCaTCCTGAGCG^^ 

GAAGSACATTGTCGTTATAGTGGGTATAAGTAACATGGATCCTAGCAAGAT'TGCrCAa^CZlGAGTATCCA 

GTCAATACCATCATC^TCCATGAGGACTTTGATAACAACTCCATGAGCAACAACATAGCCCTCCTGAAGA 

CAGACAC AGCGAT GCATTT T GGCAACCT GG T CCAGT CGA T CTGC T T CCTC GGCAG A A T i jCTGC ATAC ACC 

ACCAGTC"TGCAGAACTGCT3GGTGTCAGGATGG/iATCCCACATCTGCAACAGGAAATCACA"GACGATR 

AGT G I" CC _ GAG G AAAATCT TCGT GAAAGATCT TGACATG TG TCC CC T A- AC AAAC TCCAG AAGACAGAAT 

GCGGCAGCCACACGAAAGAGGAAACCAAGACTGC^^ 

GCTAC&GCAGTTCGATCTGTGSGTTCrGAGAGGAAT^^^ 

TTTCrGTACACCAAGGlGGAAGACTACAGCAAATGGAICACATCCAAGGCTGAGAGGGCCGGCCCTCCCC 
TGT CCTCACTCCACCACTGGGAAAAGTTGATCTCTTTCT CCCACCATGSACCAAAT GCCGCCATGACACA 
GA/iGACATATTCrGArrCTGAACTGGGCCATGTTGGATCATACTTGCASGGACAAAGAAGGACCATCACG 
CATrCAC<^TAGGAAAC&GCTCTAGAGATAGT^^ 

GGTCTCCTGAGGCGTCTGTACAACCCTl^TACTATGACTATTAC^G^GGGGAGGTGGGGGAAGGrAGGAT 
TTTT GCAGGTCAGAACAGGTTGTAT DAC^CCCGARGA & ArCATCTTGGTTTCCTTCGTGCTTGT T?TCTT T 
T G C AGCAGT AX CTAGTC C AG GAG C T ACCCCACC AAAC T GAAGAGT AAACTG AG AA T GCTGAGT GCCA G GC 
AT TCACCAT GCTGTTTTGATGTCTGTTTTTG AT AGT I GCACACTGGGGCTGCCACGGATAAGCCCAT GGC 
ATACACTGGGCTGGCTCTCCCTCCTCTATCCCTCTCCCAGGTGTGGG&AGGrCACTTTCACTATGCrrTGT 
GAAC TRAAT GCTGGCTAACAAGTGTCAAA A A A Afi A A ft A AAAAAAAAAAAAAAAAAAAAAAA 



The MOL7 protein encoded by SEQ ID NO: 15 has 395 amino acid residues, and is 
presented using the one-letter code in Table 7B (SEQ ID NO: 16). The SignalP, Psort and/or 
Hydropathy profile for MOL7 predict that MOL7 has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0. 9190. The SignalP shows a signal 
10 sequence is coded foT in the first 44 amino acids with the most likely cleavage site being between 
amino acids 30 and 3 1 . This is typical of this type of membrane protein. The molecular weight 
of MOL7 is 438 1 5.7 Daltons. 



Table 7B. Encoded MOL7 protein sequence (SEQ ID NO:16). 

EFWVLSIASA:QNRKDIWI VGI SNMDPSKI AHTEYPVUn I IHEDPDNKSMSHMI ALliKTCTAMH FGNLVQ3 1 
C FVXDLDMCPL YKLQKTE CCWHTKliiSI' KTAC iX*DPG 

S PMMCQl*QQFJDLWVIi RGIUfFGGB TCPGLFL YTKVED YS ECN I TS KAERAGPPLSSL3HWEKL I S F SHHGPNAAM 
TQKTYSDSBljGff/aS YLQC3QHHTI THKRLGNSSRD SLDVRE KDVKESG3SP EASVQP LYYDY YGGEVGEGRJ FA 
GQM3LYQPEE I ILVSPVLVPFCSS I 
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MOL7 was found to have 290 of 290 amino acid residues (100 %) identical to Hie 290 
amino acid residue hypothetical 32.6 kD protein from Homo sapiens (human) 
(ACC:CAB70765). This protein has similarity to kallikrein. 

MOL7 shows significant homologies to human hypothetical 32.6 kD protein (a protein 
5 with h'imiliarilies to Kallikrcm), as described in, but not limited to, the references below. 

Kallikreins are a subgroup of serine proteases and these proteolytic enzymes have diverse 
physiological functions in many tissues. Growing evidence suggests that many kallikreins are 
implicated in carcinogenesis. The human kallikrein gene family is localized on chromosome 
19ql3.3-ql3.4 and currently includes three members: KLK1 or pancreatic/renal kallikrein, 
10 XLK2 or human glandular kallikrein and KLK3 or prostate-specific antigen (PSA). The latter 
two genes are almost prostate-specific and they arc used for diagnosis and monitoring of prostate 
cancer and more recently, in breast cancer applications (Yousef a/., Anticancer Res 1999 Jul- 
Aug;19(4B):2843-52). These new genes, like the already known kallikreins, may have utility for 
diagnosis, monitoring and therapeutics of various cancers including those of the breast, prostate 
15 and testis. 

Monsees etaL, Immunophaimacology 1999 Dec;45(l -3): 107-14, found that elements of 
the kallikrein-kinin system are present in rat seminiferous epithelium. Peptide hormones are 
involved in the paracrine regulation of several physiological processes. The paracrine peptide 
system may play a role in die regulation of Sertoli cell function or in the Sertoli eell-germ cell 

20 crosstalk, and therefore, be involved in mammalian reproduction, especially spermatogenesis. 

Chen et al 9 J Biol Chem 1996 Nov l;271(44):27590-4, found that the kallikrein-kinin 
system participates in blood pressure regulation. One of the kalKkrein-kmin system components, 
kallikrein-binding protein, binds to tissue kallikrein and inhibits its activity in vitro. 

The glandular kallikrems are a distinct group of serine proteases with a molecular weight 

25 of 25,000-40,000 and an ability to release vasoactive peptides from kininogen in vi(ro> although 
die kininogenase activity of different kallikrems is highly variable. Ihe true physiologic role of 
specific kallikreins is often unrelated to the kininogenase activity. In the mouse a major site of 
kallikrein synthesis is the male submaxillary gland. Glandular kallikreins are also synthesized in 
the pancreas and kidney. The several kallikreins found in this tissue include epidermal growth 

30 factor binding protein (EGF-BP) and the gamma subunit of nerve growth factor (NGFG; 
162040) which are responsible for the processing of EGF (131530) and NGT (162030), 
respectively. Although EGF-BP and NGFG exhibit strict substrate specificity, they share 
extensive amino acid sequence homology and immunologic crossreactivity. Mason et al. (1983) 
concluded that die glandular kallikrein gene family comprises 25-30 highly homologous genes 
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that encode specific proteases involved in the processing of biologically active peptides. All are 
closely linked on mouse cliromosome 7 (assignment by Chinese hamster-mouse hybrid cell 
studies). Several human kallikrein genes have been isolated. 

Schedlich et al (1 987) described a human glandular prepTokallikrein gene, hGK-1, 
5 isolated from a human genomic library. The 5.2-kb gene encodes a prepropeptide of 261 amino 
acids. The mature protein is 237 amino acids long and has 66% homology with the sequence 
predicted for the human kallikrein synthesized in pancreas, kidney, and salivary gland (KLK1; 
147910). Seventy-three percent homology with human prostate-specific antigen (APS: 176820) 
was observed. Expression of the glandular kallikrein gene, like that of the APS gene, seems to be 

10 restricted to the prostate. Riegman et al. (1 989) found that the glandular kallikrein gene and that 
for prostate-specific antigen are aligned in a head-to-tail orientation and are separated by about 
1 2 kb. Southern bl ot analysis of DNA from a panel of human-hamster hybrid cells showed that 
the genes are situated on chromosome 1 9. Since the KLK1 gene is also on chromosome 19, these 
3 genes probably represent a cluster. From in situ hybridization studies, Qin et al, (1991) 

1 5 concluded that the glandular kallikrein gene and probably other kal likrein genes are located in 
q 1 3 ,3 and ql 3 A bands of chrornosoirie 1 9 and are probably near the border of these tvvo bands . 

Therapeutic applications 

20 The expression pattern, and protein similarity information for MOL7 suggest that it may 

function as human Katlikrem-like protein. Therefore, the nucleic acid and protefn of the 
invention are useful in potential therapeutic applications implicated, for example but not limited 
to, various cancers including those of the testis, prostate, and breast; mammalian reproduction, 
especially spermatogenesis; blood pressure regulation; and other diseases and disorders. The 

25 homology to antigenic secreted and membrane proteins suggests that antibodies directed against 
the novel genes may be useful in treatment and prevention of various cancers including those of 
the testis, prostate, and breast; mammalian reproduction, especially spermatogenesis; blood 
pressure regulation; and other diseases and disorders. 

Potential therapeutic uses for the inventions) are, for example but not limited to, the 

30 following: (i) Protein therapeutic, (ii) small molecule drug target, (Hi) antibody target 

(therapeutic, diagnostic, drug targeting/cyto toxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 
tissues and cell types derived from these tissues. 
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The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various cancers including those of the testis, prostate, and breast; 
mammalian reproduction, especially spermatogenesis; blood pressure regulation;and other 
diseases and disorders. For example, but not limited to. a cDNA encoding the novel human 
5 plasma membrane protein may be useful in gene therapy, and the novel human plasma 

membrane protein may be useful when administered to a subject in need thereof By way of 
non-limiting example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from, for example, but not limited to, various cancers including those of the 
testis, prostate, and breast; mammalian reproduction, especially spermatogenesis; blood pressure 

10 regulation; and other diseases and disorders. The novel nucleic acid encoding die novel human 
plasma membrane protein, and the novel human plasma membrane protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of fee nucleic acid or the protein are to be assessed. These materials are further useful in 
the generation of antibodies that bind immunospecifically to the novel substances of the 

1 5 invention for use in therapeutic or diagnostic methods. 

MOL8 

MOL8a 

A novel human Acetyl LDL Receptor-like nucleic acid was identified by TblastN using 
20 CuraGen Corporation's sequence file for MOL probes or homologs and run against the Genomic 
Daily Files made available by GenBank. The nucleic acid was further predicted by the program 
GenScan™, including selection of exons. These were further modified by means of similarities 
using BLAST searches. The sequences were then manually corrected for apparent 
inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
25 novel MOL8a nucleic acid of 980 nucleotides (also referred to as 1 1 800699-0-1 6) is shown in 
Table 8B. An open reading frame begins with an ATG initiation codon at nucleotides 1-3 and 
ends with a TGA cudon at nucleotides 2803-2805. A putative untranslated region downstream 
from the termination codon are underlined in Table 8A, and the start and stop codons are hi bold 
letters. 

30 

Table 8A. MOL8a Nucleotide Sequence (SEQ ID NO:17) 

C GTCSCrGCT GC? SCTG 2 TGCT GCTCT GGATGCTGCC GGAC ACC GTGGSG^CT CAGGAACT GAACCCTC GCGGCCG 
CAACGTGTGCCG? SCTC 2CGG Z T CCCAGGT GCC CACGT GCT GCGCTGGCTG^ AGGC AGC AAGGGGACG AGTGT GGG 
ATTGC!GffT GT GC GAAGG 3AACT CCACGTGCT GA3AGAACGAGG? GlfiCSTGAGG CCTGG CGAGTGCCGC T GC CGCC 
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ACGGCTACTTCGUTGCCAACltiUC^t^^ 

G AGAGAT^GGCXIS GGGT CCAAGGTG GGG CG GGGTCGGATCCGCCTTCGGGGCGGGTCCCC AGAGG T3GCGGCT GGA 

GTGCSGGACGCG 3CK33^I^TTCC GGC TGGC"GGGG3CACCTAC?CAA6^ACCGGGRCCTTCC A CCC GCTCCGCT CCT 

C CCCIGCAGAG? GCCC GCGCCAG^ T CTGCG C CCCCGACTGCAAG GAGCT GTGTAGCT GCC AC CC ACACGGGCAGT G 

CGAG3ACMGACAGGCCACTGTACTT<r i eA 

CAOS^ACGTGCCACCeCCGGAGCGGC^GTGCCGC^^ 

GCTACTTGaAGCGCCACSTOSC&ClGCGACCraC^G/^^ 

CTGaWMC^GTGCSCCTGCAACTCGTCTCCCTGCG^ 

rTCG^GCGCGCTGCGATaSCTJKTGCTAGTGCTC 

AGCCSGGCTACCSCGGCAAGTACIGTCGCI^GCCG'I'GCCXZCGCCGGCITC^ 

TGGC OAGT GC AASGGC CAGCAGCCG T GC ACGGTGGCCOAGG GCCGCT GC TTG ACGTGCGAGCCC&3CTGGAACGGA 
AC C ARGTGCGACCAGCCTTGCGCC AC CGGTCT C TAT GGCGAGGGCTGCAGCC ACCGCTGTCCGCCAT3CCG CGAC G 
GGCAIX^TGTAACCATGTCACCC^CAAGT^ 

GT GTAGCA AT GGC ACTT ACGGC GAGGACTGCGCCTTCGTCI GCGCCGACTGCGGCAGCG GACACT GC3ACTT CCAG 
TCGGSGCdCTGCCTGTG CAGCCCTGGCGTCC ACGGGCC<XAC^GT AAC G 
CGGACTG^C^CAGGCCTGCAGCl'GCCACGAGGACAC^xC-CGACCCGGr 

CC AGCGCAAGGGCGTG AT G3^GC GCGG GCGCGCT GCT CGTCCTGCT CGT CT GCCTGCT GC T C TCdCTGZ TCGGC TGC 

GCGG 3CGCT T GAGT CGC ATCAGCAT GAAGC T GCCCC GGATC CCGC?CCGGAGGCAGAAACT AOC CAAAGTC GTAGT 
G GCC CACXLB£GACCTGGATAA£^£ACTCAACTGC AGCTTCC TGGAGCCACCCTCAGGGCTGGAGC AC CCCT CACCA 
T CCT GGTCCTCCS GGGC CrCCTTCTCCTCGTTTGAC ACCAC TGATGAAG GCCCTGTGT ACT GTGTACCCGATGAGG 
AGGCACCAGCGG^GAG^CGGGACCCCGAAGTCCCCACT 
GACCi^GCCAGCCTCXJ^CGAGGAGG^A^ACCCCTlXCCGCGTCCT 

G AGG 3GGCCGGA3GGGCTCT3TACG CGCGC GT GGCC CGACG CGAGGCCC ©3CCGGCCCGGGCCC GGGGCG A3ATT G 
GGGGCCTGTCGCTGTCSCCATCGCCCXiAGCGCAGSAAACXGCC^^ 

C T GGATOCAC GGCAAGCACAGCGCCGCT GC AGCrGGCCGlGCGCCCTCAD^ACCGCCGCCAGGCT CC 3AGGCC GC G 

C C CA3CCCCAGCAAGA3GAAACGGAC GCCC AGCGAC AAftlGGGCGCATAQSGTCGAAC ACGGCAGCCCCCG3ACCC 

GCGACXICAACGCCGCGCCCCCCGC^GCTGCCCGA^^ 

AGCG330GCCGCGCCCCSGCCTCTTGGAGCCGA^ 

ATGTTGGCCGCTGA^GCGCGGC^GACTrcCAGCCr^ 

GGGA<U^CCGGCGC£CCraCAGAAAGCCAAGCGeTrc^ 

C GAAACCCCGGGGCCTG AGAAGGCGG<X5ACCGACTT 3CCCGCGCC TGAGACC CCC CGGAAG AAGACC Z CCA1 C GAG 

aagcc&<x:gcgcaagaagagccgcaggcgcxx^^ 



The MOL80 protein encoded by SEQ ID NO: 16 has 324 amino acid residues, and is 
presented using the one-letter code in Table 8B (SEQ ID NO: 1 8). The SignalP> Psort and'or 
Hydropathy profile for MOL8a predict that MOL8a has a signal peptide and is likely to be 
5 localized at the plasma membrane with a certainty of 0.6000. The SignatP shows a signal 
sequence with a clcavagp site between amino acids 43 and 44. This is typical of this type of 
membrane protein. Therefore it is likely that this novel human plasma membrane protein is 
available at the appropriate sub-cellular localization and hence accessible for the therapeutic uses 
described in this application. 

10 



Table 8B. Encoded MOL8a protein sequence (SEQ ID NO:18). 



MEGAGPRGAGPARRRGAGGP5S PLLPSLUXI LLPfMLPDTVAP^lIiNPRGRNVCR APG5QVPTCCAGKIRQQGDECG 
I AVCEGNS TCS ENEVC VRPGE CRC RHGYFGAN CD T S E RG VGPVL-VGGAES WRDGAGS K VGRGRIRLRGG S PEVAAG 
VRDAGRFRIAGoT YS H TGAFH PL CISS rAECPRQ FPfGriXJKELC SCHU HGQCE DVTGQCTCHARR WGARCE EACQ CQ 
HGT CHPRS G ACRCES GWW3 AQC AS AC YCS AT S RCD E QTG ACLCHAGWSGR SCtlttQC ACtf SS PCEQQSGRCQCRERT 
FGARC DRYCQCFRGRC H P VDGTCACE PGYRGK YCRI PCP AG FYGLGCRRRC GQCKG QQPCTVAE GRCL"CE PG3TOG 
T KCDC PCAT GFYGEG CG IIRCP PCR DGIlACNllVTGKCTRCNA3?£t GERCET KCSNGTYGE DCAFVC ADCGBGI IC DPQ 
SGRC1CS P GVHGPHCK V?CP PGUiGADCAQAC SCH2DTC DPVI GACHLET N QRKGVMGAGALLVLLVCLLLSLLGC 
CCACRGKD ?TRRE L£ LGRKKAPIIRLCGRPS RI 5MKL PRI PLRRQKL PKW VAHH DL DNTLNC S FLE P PS GLSQP 3 P 
SWS S RAS FS S FDT ^DBG P V YC VPKE EAPAESR C PEVPTVPAEAPA PS P VPL T T PA3 AEEAI PL FAS S DS ERSAS 3 V 
EGPGGALYARVARREARPARARGE IGGLSL5 PSPERRKP?PPD?ATKPKVS WIHGKHS AAAAGRAPSPPPPGSEAA 
PS P S KRKRT P S DKSAF. TVE KG S PRT RDPT PRP PGL PEEATAL AAPS P PRARARAAP RPLGAHGRRR S PAKRAE AAS 
MLAAD VRG KTRSLGRABVALGAQG PRE KP AP PCKAKR S VPPAS PARAP PATET PG P EK?AT DLJAPET PRKKT P IQ 
KPPRKKSREAA3ELGFAGAPT:j 
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The fiill amino acid sequence of the protein of the invention was found to have 576/729 
(79%) identical and 596/729 (SL%) similarity to a murine nurse cell receptor amino acid 
sequence (PatP Accession No. Y856 J 6). Hie foil amino acid sequence of the protein of the 
5 invention was also found to have 296 of 741 amino acid residues (39 %) identical and 383 of 74 1 
amino acid residues (51 %) homolog to the 830 amino acid residue acetyl LDL receptor 
precursor horn Homo sapiens (human) (ACC;O43701). 

MOL8a is expressed in the following tissues: fetal thymus, mammary gland, fetal 
thymus, pool of ten tissues (adrenal, mammary, prostate, testis, uterus, bone marrow*, 
10 melanoma*, pituitary*, thyroid*, spleen) (*firom mRNA rather than from total RNA). 



MOL8b 

A novel nucleic acid was identified by laboratory cloning of cDNA fragments, by in 
silico prediction of the sequence, cDNA fragments covering either the full length of the DNA 

1 5 sequence, or part of the sequence, or bou% were cloned. In silico prediction was based on 
sequences available in Curagetfs proprietary sequence databases or in the public human 
sequence databases, and provided either the full length DNA sequence, or some portion thereof. 
These were further modified by means of similarities using BLAST searches. The sequences 
were then manually corrected for apparent inconsistencies, thereby obtaining the sequences 

20 encoding the full-length protein. The disclosed novel MOL8b nucleic acid of 2593 nucleotides 
(also referred to as CG508S9-02) is shown in Table 8C. An open reading frame begins with an 
ATG initiation codon at nucleotides 1-3 and ends with a TAG codon at nucleotides 2596-259$. 
A putative untranslated region upstream from the initiation codon and downstream from the 
termination codon are underlined in Table SD, and the start and stop codons are in bold letters. 

25 



Table 8D. MOLSb Nucleotide Sequence (SEQ ID NO:19) 



ATGG AG GGCGC?.GGGCCCCGGGGGGCCGGGC02GCGCG 3CGC CG GGGAGCCG GGGGGCC GCCGT CACCGCT G C?GC 
CGTC SC TGCT 3-CT GCTGCTGCT GOT CTGGATGCTGCCGGAC AC C GT G GCGCCTC AG GAACTGAACCCTCG G GGCCG 
CAACSTGtGCCGTGCTCCXX^ 

AT T3 CX5GTGT 3CGAAGGCAACTCC ACG? GCTC A3AGAACG AGGTGTGCGTG AGG CCTG SCGAGTGCCG CTSCCG CC 
ACGGCTACTT CGGTGC CAACTGCG AC AC CAAGT GCCCG C GCCAGTT C TGGGG CC C C GACT GCAAGGAG CTGT GTAG 
CTGCCACCCACACGGGCAGTGCG AGGAC GTGACAGGCCAGTGT ACT TGTCACGC GC GG^CTGGGGCG CGCGCT G C 
GAGC AT G C(7TGC CAGT GCCAGCAC GGCACGT GCCT^CCC GC GGAGCG GC GCGT GCCGCTGTGAGCCXGGCTGfiTGSG 
G<^GCAGIX3CC^CAGCGCGTGCT^^ 

CGC ASG CI GGT GGGGCCGC AGCT GCAAC AACCASTGCGC CTG CAACT C 5TCT CCCTGCGAGCAGCAGAGCG G CCGC 
TGTC AGTGCC GC GAGCGTACGT T CGGCG CGCGCXGCGA T CGCT ACTG CCAGT GCTr<X3CGGCCGCTGCC ACCCTG 
TG3ACGGCAC GTGTGC CTGCGAGCCGG G CT ACCGCGGC AAGT AC TGT CGCGAGC CGT G CCCCGCCGGCTTCT AC SG 
CI 1 i'3 GGCTGTCGCGC5CCGGTGTGG CCAG TGC AAGGGCC AGC AGG CG r GC ACGGT 3GCC SAGGGCCGCTGC TTGACG 
TGG3AGCCCGGC TGGA ACGGAACC AAGT GCGAGCAGCC T T GCGCCACCGGTT TC T ATGGCGAGGGCTGC AGCCACC 
GCT3TCCGCC ATGCCGCGACGGGC A'JGC CTGT AACCAT Gl C ACCGG CAAGT GT ACGCGCTGC&ACXaCG GGCT GGAT 
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AGC GGAC AC T G CGACTT CCAGTCGGGGCGCTGCCTGT GCAGCCCTGGCGT CCACGGG CCCCACTGTAAC GTCACGT 
GCCCUXCCGG^CTCCACGGCC^GGACTGTGCTCAGGr^TGCJiCOT 
TGCC^^CCTAGAAACCAACCAGCGC^GGGCG^ 
CTGCTCTOGCIGCTCGGCTGCTGCTGCGCTTGCrcC^^ 

AGAAG GCGCCGC AC CGACT ATGCGGGCGCT T CAGT CGCATCAGCA7 GAA3CTGCC CCG 3ATC CCGCTCCG GAGGC A 

GAAACTAOCCTJ^AGTCGTAGl^CCCACCACGACCTGGATAACAC^ 

GGGCTGG&GC£C£CCTCACrATCCTGGTCCTCT^ 

TGTACT GTGTACCCC AT GAGGAGGCACCAG CGG AGAGCCGGGACCC C GAAGTCCC CACTGTC ~C?GCCGAGGCGCC 
GGCCCC GTCCCCTGTGC CCT^GACCACGCC AGCCT CCGCCGAGGAGGCG ATACCC CT C CC CG CGTCCT CCSACAG C 
GAGCGGTCCXiCGTCCAGCGTGGAGGGGCCCGGAGG 

C CCGG GCCC GC-GGC GAGATTGGGGGCCrGT CGCTGTCGC CATCGCC C GAGCX5CAGGAAACCGCXGCCACC T GACC C 
CG CCACCAACCCTAACC rCTCCTGGATCC ACGGa^AGCACAGCGCC GCTGCAGCT GGC CGTGCGOCCTCACC ACCG 
CEGC^GCCTCrcAGGCCGO^C^^ 

AAC AC GGCAGCCC CCGGACCCGCGACCCAACGC CGCGC CCCCCGG GGCT GCCCGAGG AGGCGAC AGCCCT CGC7GC 
C C COT C CCCCCCC AG GG CCCGACCCCCCC CCCCCCCCC CCCCTCTT G CAo CCCACGGAC GCC GGCGGTCCCCCGCG 
AAG CGC GOCG AGGCTGCCTCCATGT T GGCCGCT GACGfr GCG CGGC AAGACTCGC AGCC T GGGCCGCGCCGAGG2GG 
CCCTGGG<XCGCAG3GCCCCAGGGAAAAGCC^ 

CGCCCGCGCGCCC CC AGCGACOT AAACCCCGGGGCC^ AGAAGGCGGCG ACXX& C 
CGGAAGAAGACCC CC AT CCAGAAGCCGCCGCGC A a JGAAGAG CCGG 3AGGCGGOGGGCGAGCTGGGCAGGGCGGGC G 
CACCCACOCTGTAG : 



The disclosed nucleic acid sequence has 1311 of 2041 bases (64%) identical to a 
gb:GENBANK-lD:D86864|accO)86854.1 mRNA from Homo sapiens (Homo sapiens mRNA for 
acetyl LDL receptor, complete cds) OB value = 3.2e- 98 ). 

5 The MOLSb protein encoded by SEQ ID NO: 17 has 865 amino acid residues, and is 

presented using the nne-leticT code in Table 8E (SEQ ID NO:20). The SignalP, Psort and/or 
Hydropathy profile for MOL8a predict that MOL8a has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0.6000. The Signal? shows a signal 
sequence with a cleavage site between amino acids 43 and 44. This is typical of this type of 

10 membrane protein. Therefore it is likely that this novel human plasma membrane protein is 

available at the appropriate sub-cellular localization and hence accessible for the therapeutic uses 
described in this application. 



Table 8E. Encoded MOL8b protein sequence (SEQ CD NO;20). 

Hfcl GAG V RGAG W\KRRG AGG P ?3 ?LlPST,TJJJJ>I,hmPDTVA¥QEL 

IAVCEGNSTCSENEVCVRPGECRCRnGYF^^ 

BEACCKQHGTaiPRSGACRCEP^Sl^ 

C QCKB KT fGARCDR Y C QC If KG RCH PV DGT CACE PG Y3G K YCRE PC PAG FYGliGCRRRCGQCKGQQ PCTVA3GRCLT 
CB PGWXGTKCIXiPCATGFYGBGCSHRCPPCRIXiIlACNir/rC^CTRCI^Gif IGDRCBTKCSWGT YGB DCAFVCADCG 
3 G3 CD FQS GRC1C S ?GVRGPHC^/TCPPGLHGADCA0AC5CH EDTCDPVT GACHLETN QRKGVM GAG AT J. VT J , VCb 
LLSLItfCCa^C^GKDPTRRIU^LGRKK^^ 
Gr>3QPSPSW9SRA9?3SFIOTDGGm<^ 

B RS A3 5 VEG PGO^LYARV ARREAR PARARGE IGGLSL3 PS PE RRKP P P P D ? ATKPKVS W IHGKH3 AAAAGRAPS P? 
P P GSEAAP 5 PS KRKRT PS DKS?iRTVEKGSPF/rPJ>PrPRPE^PEfcATAl^PSP PA 
KRAEAASMI/AADVRGKTRSLGRAE VAL GAQGPRE K?AP PQKRKR3 VP PAS P ARAPP AT ET PG P3 KAATDL PAPET P 
RKKT P I QKPPRKKS REAAGELG RAGAP T L 



The full amino acid sequence of the protein of the invention was found to have 340 of 
15 823 amino acid residues (41%) identical to, and 443 of 823 amino acid residues (53%) similar to, 
the 830 ammo acid residue ptnr:SPTREMBl^A(X:O43701 protein from Homo sapiens 
(Human) (ACETYL LDL RECEPTOR PRECURSOR) (E value = 9.0e- 1<54 ). 
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Homology between M0L8a, 8b and the human acetyl LDL receptor are presented 
graphically in the multiple sequence alignment given in Table 8F (with MOL8a being shown on 
line I, and MOL8b being shown on line 2) as a ClustalW analysis comparing MOL8 with related 
protein sequences. 



Table 8F. Information for the ClustalW proteins: 

1) HOL8a (SEQ ID NO: IB) 

2) HOL8b (SEO ID NO:2C) 

3) gi 1 4507203 1 ref]NP_0C3684 .11 acetyl LDL receptor; saEOsravenger receptor 
expressed by endothelial cells [Ror«o sapiens] (SEQ id NO: 57) 




M3L8a prot 
MDL8b Profc 
MP 003684 



MDL8& Profc 
MOL8b Vrot 
O03fift4 



297 
22$ 
207 
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15 



MOL&> Prot 
NP 003684 



610 



19 hM 



W&Mn l i 525" 
aPEHPeHK 507 




MOIifia PrciL 
MOLSb Prot 
NP 0036B4 



.E gPPEWSlgr ATPKfg— §L fl{RA55Rg3-VS |flRCTKFAPQ SRgSSgg GOB 





VEME33V QESSGPVTTI YHLAGKflRG& BGPVgSV^5l 
970 • 



360 
..I . 



880 

. • I . 



|. 



890 



AADVBGKIF 
AADVRSKW 



LSAQGPREKP 
LSA0G?REEP 



EKP PRQALMtfKKG SPCiJASGSfflG QSP23SAPKAG 




910 



920 
.)-... I. 



930 
.1 



940 



950 



HOLBa Prot 
MOLBb Prot 
MP 003684 



934 
B65 
830 



Chromosomal information: 

MOL8 maps to chromosome 22ql 1 . This assignment was made using mapping 
information associated with genomic clones, public genes and ESTs sharing sequence identity 
with the disclosed sequence and CuraGen Corporation's Electronic Northern bioinfonnatic tool. 

Tissue expression 

MOL8 is expressed in at least the following tissues: kidney, senescent fibroblasts, 
lymphocyte, B cell, and germ cell tumors. Expression information was derived from the tissue 
sources of the sequences that were included in the derivation of the sequence of CuraGen Acc. 
No, CG50889-02. 

MOL8 shows significant homologies to human LDL Receptor-like protein, as described 
in, but not limited to, the references below. Hypercholesterolemia is an autosomal dominant 
disorder characterized by elevation of scrum cholesterol bound to low density lipoprotein (LDL). 
Mutations in the LDL receptor (LDLR) gene on chromosome 19 cause this disorder. Familial 
hypercholesterolemia is characterized by elevation of serum cholesterol bound to low density 
lipoprotein (LDL) and is, hence, one of the conditions producing the hyperiipormjtememia H 
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phenotype (see OMIM 144400). Heterozygotes develop tendinous xanthomas, corneal arcus, and 
coronary artery disease; the last usually becomes evident in the fourth or fifth decade. 
Homozygotes develop these features at a accelerated rats m addition to planar xanthomas, which 
may be evident at birth in the web between the first two digits. 
5 Hepatitis C virus (HCV)» the principal viral cause of chronic hepatitis, is not readily 

replicated in cell culture systems, making it difficult to ascertain information on cell receptors for 
the virus. However, several observations from studies on the role of HCV in mixed 
cryoglobulinemia provided some insight into HCV entry into cells. Evidence indicated that HCV 
and other viruses enter cells through the mediation of LDL receptors: by the demonstration that 

10 endocytosis of these viruses correlates with LDL receptor activity, by complete inhibition of 
detectable endocytosis by anti-LDL receptor antibody, by inhibition with anti-apolipopiotein E 
and anti-apolipoprotein B antibodies, by chemical methods abrogating hpoprotcin/LDL receptor 
interactions, and by inhibition with the endocytosis inhibitor phenylatsine oxide. Agnello et al. 
(1999) provided confirmatory evidence by the lack of detectable LDL receptor on cells known to 

15 be resistant to infection by one of these viruses, bovine viral diarrheal virus (BVDV). 

Endocytosis via the LDL receptor was shown to be mediated by complexing of the vims to very 
low density lipoprotein (VLDL) or LDL, but not high density lipoprotein (HDL). Studies using 
LDL receptor-deficient ceils or a cytolytic BVDV system indicated that the LDL receptor may 
be the main hut not exclusive means of cell entry of these viruses. 

20 Therapeutic uses of the composition 

The expression pattern, and protein similarity information for MOL8 that it may function 
as human LDL Receptor-Kke protein. Therefore, the nucleic acid and protein of the invention aie 
useful in potential therapeutic applications implicated, for example but not limited to, metabolic 
disorders, e.g. Hypercholesterolemia, viral diseases, and other diseases and disorders. The 

25 homology to antigenic secreted and membrane proteins suggests that antibodies directed against 
the novel genes may be useful in treatment and prevention of metabolic disorders, e.g. 
Hypercholesterolemia, viral diseases, and other diseases and disorders. 

Potential therapeutic uses for the invention(s) are, for example but not limited to, the 
following: (i) Protein therapeutic, (ii) email molecule drug target, (m) antibody target 

30 (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 
tissues and cell types derived from these tissues. 
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The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various cancers including those metabolic disorders, e,g. 
Hypercholesterolemia, viral diseases, and other diseases and disorders. For example, but not 
limited to, a cDNA encoding the novel human plasma membrane protein may be useful in gene 
5 therapy, and Lhe novel human plasma membrane protein may be useful when administered Lo a 
subject in need thereof. By way of non-limiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from, for example, but not limited 
to, various cancers including those of the metabolic disorders, e.g. Hypercholesterolemia, viral 
diseases, and other diseases and disorders. The novel nucleic acid encoding the novel human 

10 plasma membrane protein, and the novel human plasma membrane protein of lhe invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. These materials are further useful in 
the generation of antibodies that bind immunospecifically to the novel substances of the 
invention for use in therapeutic or diagnostic methods. 

15 These materials are further useful in-the generation of antibodies that bind immuno- 

specifically to the novel MOL8 substances for use in therapeutic or diagnostic methods. These 
antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOL8 protein has multiple hydrophilic regions, each of which can be 

20 used as an immunogerx In one embodiment, a contemplated MOL8 epitope is from about amino 
acids 1 to 10. In another embodiment, a MOL8 epitope is from about amino acids 50 to 200. In 
further embodiment, a MOL8 epitope contains amino acids 210-400, 475-600, or 625-850. These 
novel proteins can also be used to develop assay system for functional analysis. 



25 MOL9 

MOL9a 

A novel nucleic acid encoding a neurorysin -like protein was identified by TblastN using 
CuraGen Corporation's sequence file for MOL9 probe or homolog, run against the Genomic 
Daily Files made available by GcnBank. The nucleic acid was further predicted by the program 
30 GenScan™, including selection of exons. These were further modified by means of similarities 
using BLAST searches. The sequences were then manually corrected for apparent 
inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
novel MOL9a nucleic acid of 2355 nucleotides (also referred to as 1950671 9_B_EXT) is shown 
in Table 9 A. An open reading frame begins with an ATG initiation codon at nucleotides 1-3 and 
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ends with a TGA codon at nucleotides 1915-1917, A putative untranslated region upstream from 
the initiation codon and downstream from the termination codon are underlined in Table 9 A, and 
the start and stop codons are in bold letters. 

The nucleic acid sequence has 1307 of 1428 bases (91%) identical to a pig neurolysin 
5 mRNA (GENBANK-ID: AB0001 70) (Expect = 0.0). 



Table 9A. MOL?a Nucleotide Sequence (SEQ ID NO:21) 

ATGTTGACTTTGGAC: CAACAGAAATCCCTAATT CT TAT TOT TTT CTGAT TCTTT TT AG AGTTGGTGGTTC 
CAGGATTTTAC T CAGAAT GA.CGT t AGGAAG AG AAGT^TGTCTCCTCTT CAGGCAATG TCT TCCTAT ACT 3 
?GGCT&3CAGAAATGTTTTAAGAI©33A^ 

GT G CAGACCAAACAGGTGTACGAT GC TG TTGGA7 lTGCT CGGTRT? GAGGAA3T AAC TT AC G AC AAC ICTCT 
GCAGGCACTGGC AGT G GAAAGGACCAT GCT AGACTT? CCCCAGCAT GTAT 3 CTT CTGACAAAGAA3T ACGAC 
CAGCAA^ACAGAAGCAGACAAAAGACTTTCT 

GAG AGAAT TC TT C AliT' T AC ASCAGGAAACCT GTG AT CTGGXi; ^AAGAT AAAACCl^AGGCCAGACC^' ACU' 'f 
GGAAAA3TCAATTA AAAT GGGGAAAA3A AATGGGCTCCATO TCCTG A ACA AGTAC AG AATGfl A AT 7.AAAT 
C AATGAAGAAAAGAAT GAGT-GAGCTATGT ATTGArir'T AACAAAAAC CT CAAT GAG GAT GAT AC CTTCCT ! 
GT ATT TTCCAAGGCT GAACTTGGT GC TC T TCCT GATGATTTC ATTG ACAGTTC AGAAAAGAC AGAT GATGA 
CAAG^ ATAAAATTACC T TAAAATATC C ACACT AT TTCCCTGT CAT SAAGAAAT GT T GT AT CC CT GAAACC A 
GAAGAASGjVTGGAAATGGCTTTTAATACAASGT^^ 
CCACTGSC-AACCAAGGTGGCCAAACTACTCGGT^^ 

TG CA A^GAGC AC AAGC CGC GT AACAGC CTT T C T AGA T GATTT AAG CCAGfc ASTTA AAACCCT T GGG TGAAG 
CAGAACGAGAGTTTATT TT GAJVITTGAAGAAA-AAGGAATGC AAAGACAGG SGTTTT GAAT ATGATGSGAAA 
ATCAATIKCTGCJGATCTATA^ 
CCTCAAGGAATACTTCC£AATTGAGGTGGTCA^ 

TT T CAT TT GAAC AAAT GAC AGATGCT CATGT T TG GAACAAG AGTGTT ACAC TT TAT ACT GTGAAGGAT AAA 
GC T ACAGGAGAAGTATT GGGACAGTT CT AT T T GG ACCTCTAT CCAAGGCC T AGGGAAGGAAAAT AC AATCA 
TGCGGCCTXSCTTCGGTCTCCAGCCTGGCTGCCTTCTGCC^ 

TCGT GGIGAACT TCTCACAGCCAGT GGCAGGTCGT CCCTCT CTCCT GAGACACGACGAGGrGAGGACT TAG 
TT TCAT GAG TT T GGT CACGT GAT G CAT CAGAT TT GT GCACAG ACT GAT TT I GCAC GAT T T AGCG 3AACAAA 
TGTGGAAAC TGACTTT GT AGAGGT GCCATCGCAAAIGCTTGAAAATTG G^T GT^GGACGTCGATTCCCTCC 
GAAGATTGTCAAAACATTATAAAGAT GGAAGCCC P ATTGCAGACGATCTG ^ TTGAAAAACTTGTTG CTTCT 
AG GC TGGT CAACACAG GT ATGGGTTAT GTTAT TAGT AATAT ATAT T T TCT AGATAT GT T T TCATTT CAGTG 
CAT C CAAGTT G GAATGAAAT ACAG AAACCT AATC CT GAAAC CTGGGGGATCTCTGGACGC-C ATGG ACATGC 
TC CACAATTTC T TGAAA(^TGAGCC^AACCAAAAAGCgITC CTAAT GAGT AGAGG CCTGCATGCTCCGTGA 
AC TGGG5AT CT T T GG T AGCCC-TC C AT GTCTGGAGGACAAGT CGACATC AC CATGT GTT ACT GGCCT GGAAA 
CT GAAGGGAGT T T TGCAAGT GAAAATTTAGAT TT CTATTG ACAT CCTT TT STTTT CT AATT TTAAAAATT A 
T AAAG AT GT AAAT GGAATl^lAAATACTT^ GT ACT ATAAA 
AT TT CATAAAACT GGATT TGATT T CTTTT TAT GAAAGTTTCAT AT GAA TG TAACT T GATTT TTT ACTAT TA 
TAAT C T AGATAAT ATGAT ATAAGAGGGCT ARGiUlTTTTT AA AT TGAArCA TATA TATGATATA AI M TGATC 
CTT C TT GT ATC TTGAAGTT T TGTACT T GGG AT TT C T GG ACT GATAAATGAA.TCATCAC AT TCT T CT GGT AA 
ATATTTICTTGG " 



The MOL9a protein encoded by SEQ ID NO:21 and has 638 amino acid residues, and is 
presented using the one-letter code in Table 9B (SEQ ID NO:22), The SignalP, Psort and or 
10 Hydropathy profile indicate that this sequence has a signal peptide between positions 23 and 24 
and is likely to be localized at endoplasmic reticulum and plasma membrane (Certainty = 
0.8200). 



Table 9B. Encoded MOL9a protein sequence (SEQ ID NO:22) 

MLTL DQQKSLI L ILFL I L tRVGGS R I L1RM7LGREVMSPLQ AMS S YT VAGRNVLRtt DI»S PE Q I K73TEEL I 
VQT KQV YDAVGMLG I EEVT YE^CLGALAVER'JML DF P QHVS S DKE VRAA5 TEADKRLS RE D I EM3HRGD I F 
ERIV3LQQETCDLGKI KPEARRYLEK9IKM3KRHGLHLPEGVQNEIKSMKKRMSELCI DFNKKl^lEODTFL 
Vp S KAELGALP DD FI D S LE KTDD DKYKI TL KYPHYFP VMKKC C I P E T RRRMEMAFN TRC KEEK T 1 1 LGQLL 
PLi\TX VAKLLCY STHAD P VLEMNT AKS T S RVT Ai'LtfDLS QKL KPLGfi AE RE ET LXL KKKU CKDRG FE YDG K 
IKAWDLYYYMTQTEELKYS I DQE3XKR YFPZEVVTZGLIiNTyQELLGljS FEQtffiDAKVWHKS VTLYT VKDK 
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FI IB FGH VMHQI CAQT D FARFS GTNVET DFVEVP3 QMLENW VNDV D SLRRL SKH YKDSS PI AD DI J iEKLVAS 
Rl-VNTGMGrv/ISNIYFLOOTSTOIQ^ 

The full amino acid sequence of the protein of the invention was found to have 483/564 
(85%) identity and 508/564 (90%) similarity to a rabbit endopeptidase (PatP Accession No. 
R261 14), and have 571 of 632 ammo acid residues (90%) identical to, and 592 of 632 residues 
5 (93.6%) similar to, the 704 ammo acid residue neurolysin protein fiom pig (ptnnSPTREMBI^ 
. - ACC: Q02038) (E value - 1 . le- 302 ). 



MOL9a also has homology to Ihe proteins shown in the BLAST alignments in Table 9C. 



Table 9C. BLASTX results for MOL9a 






Smal. 


Lest 








Sum 








High 


pcob 






Sequences producing Hlgn-scotlng Segment ?airs: 


Score 


P(N) 




n ; 


ptnr:S>7ISSN3Vr-ACC:Q0203e NEUROLY9IN PRECURSOR (EC 3.4.24.. 


. 2665 


1 .le- 


302 


2 


ptnr:SXIS8PRQT-ACC:P13675 WBUROIiYSIN PRECURSOR (EC 3-4-2C 


. 2697 


2.2e- 


302 


2 


ptnr:SPTREK3L-ACC:P79433 BNDOPEPTIDASB 24.16 (EC 3.4.-.-). 


. 2677 


7.6e- 


302 


2 


ptnr:SWISSPROT-ACC:P42676 NRUROLYSIN PRECURSOR (EC 3.4.24. 


. 2601 


4.7e- 


2 91 


2 


ptnr ; SWrSSPROT-ACC:P47788 THIBET GLIGOPEPTinASE (EC 3.4,2. 


. 1792 


fl.4e- 


197 


2 



Chromosomal information 

10 MOL9a maps to the Unigene entry Hs. 2215 1 which maps to chromosome 5 between 

markers D5S427-D5S647 (69.6-74.7 cM). 

Tissue expression 

MOL9a is expressed in at least the following tissues: fetal lung, testis, B-cell, aorta, 
15 brain, colon, foreskin, geim cell, heart, kidney, pancreas, stomach, uterus, whole embryo and 
cancer cell lines MDA-MB-231 and MCF-7. 

These materials are farther useful in the generation of antibodies that bind immuno- 
specifically to the novel MOL9a substances for use in therapeutic or diagnostic methods. These 

20 antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOL9a protein has multiple hydrophilic regions, each of which can be 
used as an immunogen. In one embodiment, a contemplated MOL9a epitope is from about 
amino acids 50 to 75. En another embodiment, a MOL9a epitope is from about amino acids 100 

25 to 150. In further embodiments, MOL9a epitopes are found in amino acids 175-200, 225-300, 
325-375, 425-450, 500-550, and 600-625. These novel proteins can also be used to develop 
assay system for functional analysis. 
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MOL9b 

The cloned open reading frame, codes for a 687 amino acid long protein with an overall 
95% amino acid identity, to the mature form of the pig neurolysin precursor (SWISSPROT- 
5 ACC:Q02038)>Oligonucleolide primers were designed to PCR amplify a DNA segment, 
representing an OR*, coding for the mature form of 1 95067 19JB fiXT.The forward primer 
includes an. in frame, BamHI restriction site. The reverse primer contains an, in frame, Xhol 
restriction site. The sequences of the PCR primers are the following: 

10 195067 I9JB-EXT Mat-Forw: 

GGATCCTCGAGGATTTTACTCAGAATGACGTTAGG (SEQ IDNO:58} 
195067 1 9_B-EXT FL-Rev: 

CTCGAGCGGAGCATGCAGGCCTCTACTCATTAGGAACG (SEQ IDNO:59) 



1 5 PCR reactions were set up using a total of 5ng cDN A, consisting equal amounts of 

cDNA derived from human fetal brain, testis, skeletal muscle and mammary, template, 1 microM 
of each of the 19506719 B-EXT Mat-Forw and 195G6719JB-EXT FL-Rcv primers, 5 
micromoles dNTP (Clontech laboratories, Palo Alto CA) and 1 microliter of 50xAdvantage-HF 
2 polymerase (Clontech Laboratories, Palo Alto CA) in 50 microliter voJurne. The following 

20 reaction conditions were used: 
a) 96°C 3 minutes 

b) 96°C 30 seconds denaturation 

c) 60°C 30 seconds annealing 

d) 72°C 3 minute extension. 

25 Repeat steps b-d 35 times 

e) 72oC 10 minutes seconds final extension 



A single, 2. 1 kb large, PCR product was isolated from agarose gel and ligated to pCR2.1 
vector (Invitrogen, Carlsbad, CA). He cloned insert was sequenced, using vector specific, Ml 3 
30 Forward(-40) and M13 Reverse primers as well as the gene specific primers: 

GGACCATGCTAGACTTTCC, 1 95067 19JB-EXT SI; {SEQ ID NO:60) 
GGAAAGTCTAGCATGGTCC, 19506719_B-EXT S2; (SEQ ID NO:61) 
GGCTGAACTTGGTGCTCTTCC, 1 950671 9_B-EXT S3; (SEQ ID NO: 62) 
GGAAGAGCACCAAGTTCAGCC, 1950G719JB-EXT S4; (SEQ ID NO:63) 
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GGCTTGCTGAACACCTACC, 19506719_B-EXT S7; (SEQIDNO:64) 
GGTAGGTGTTCAGCAAGCC, 1950671 9J3-EXT S8; (SEQ ID NO:65) 
GCACAGACTGATTTTGCACG, 1 95067 19_B-EXI S9; (SEQ ID NCX66) and 
CGTGCA AAATCAGTCTGTGC, 1 950671 9_R-EXT S10 (SKQ TT)NO:67). 



The disclosed novel MOL9b nucleic acid of 2061 nucleotides {also referred to as 
MOL9b) is shown in Table 9D. It is thought that MOL9b is an internal fragment of an open 
reading frame. Therefore its 5* and 3' termini may be extended. 



Table 9D. MOL9b Nucleotide Sequence (SEQ ID NO:23) 

~TCC AG TTTACTCAGAAT G ACGT T AGGAAG AGAAGTGAT GCCT CCTCTTCAGGCAAT G T CTT CC T ATAC 
1GTGGCTGGCAGAAAT GTTT TAAGATGGGA1 CT T 'I'C ACCAGAGCAAAT T AAAACAASAAC TGAGGAGCT CA 
% TGTG CAGAC CAAACAGGTGT ACGAT SCTGT1G GAATGCTCGGTATTG AGGAAGTAAC TTACGAGAACT GT 
CTGCAGGC^CTGGCAGAT^AGAAGTAAAGTATATAGTGGAAAG^ 

ATC CT CTGAC AAAGAAGTACGAGCAG CAAGT ACAGAAGCAGACAAAAGACTT T CTCGTTTT GAT ATTGAGA 
1 G AGCATGAGAGG AGATAT A? T Tli^GAGAATTGTTC ATTT ACAGG AAACCTGT GAT CT GGGGAAGATAAAA 

CCT G AGGCCAGAC gat actt g gaaaagttcaat t aaaatcgggaaaagaaatgggct ccat c t tcc t gaaca 

AGT ACAGAATGAAATCAAATCAAT G AAGAAAAGAAT GAGT GAGCT ATGTATT GATTTTAAC AAAAACCT CA 
ATG AGGAT GATAC CTTCCTTGT AT TT TCCAAG G CTC AACT T GCTTGCTC T*_' CCT GA'l 1 GJYJL1 1 T CAT T GACAGT 
1 T AGAAAAGACA3 ATG AT GACAAGT ATAAAAT T ACCTT AA AATATCCACACT AT TTCCCT GT CAT GAAG AA 
ATGTTGTATCrcTGAAACCAGAAGAAGGATCG^^ 

TAATTTTGC AGCAG CT AC TCCCACrGC^JVZC/'JiGGT GGCCAZ\ACTACTC GGT T ATAGCAC ACAI GCTG AC 
TTCGTCCT T GAAAIGAACACTGCAAAGAGCAC AAGCCGCGT AACAGCC TT TCT AGATGAT TT AAGCCAGAA 
GTT AAAACC GTTG GGTG AAGCAGAACGAGAGTT T AT T TTGAATTT GAAG AAAAAJ3GAATGC AAAGACAG SG 
GT TT TGAAT ATGA1 GG GAAAATCAA T GCCTGG GATCT AT ATT ACT AC/i iT Gv\Cr C *\GZiCAGAGGAACT CAAG 
TATTC CAT AG ACC AAGAGTT CCTC AAGGAATACT T CCCAATTGAGGTGG T CAC T GAAGGCT TGCT GAACAC 
CT ACC A-GGAGTTG1 TGGGAC T TTC AT TTG AACAAAT GACAG ATGCTCAT G TTT GG AACAAGAGTGT T AC AC 
'J 'IT Al' ACTGT GAAGGAT AAAGCTACAGGAGAAGT ATTGGGACAGT T CT AT TTGGACCTC? ATC CAAGGG AA 
GGAAAATAC AA7C ATGCG GCCTGCTTCGGT CT CC AGCCTGGCTGCCTTCTG CCT GATGGAAGCCGGAXG AT 
GGCAGTGGC TGCCCTCGTGGTGAACTT CT CAC AGCCAGTGGCAGGTCG'TCC CTCTCTC CTGAGACAC3 ACG 
AGGTGAG GACTT ACT TT C ATGAGTTTGGT C AC GT GATGCATCAGATTTGT G CACAGAC TGATT TT GC ACGA 
TT T AGCGGAACAAAT GT3 GAAACTG ACTT TGT AG AGGTGCCATCGC A AATGCT T GAAAAT TGGGT GT 3GGA 
CGTC GAT TCCCrC CGAA3 ATT GT CAAAAC ATT AT AAAG AT GGAAGCCC7 AT TG CAGAC GATCT GCT T SAAA 
AACTTGTTGCTTC T AGG C T GGTCAACACAGGTCTTCT GACCCT GCGCCAG ATT STTT TGAGCAAAGT TGAT 
CAGTCTCT T CAT AC C AAC AC AT C GCT G GATGC T GCAAGTGAATA.T GCCAAAT ACi^CT CAGAA AT ATTAGG 
AGT'TGCAGC TACT CCAGG CACAAATAT GCCAGCT ACC TTTGGACAT TT GG C AGGGGGATAC GAT GGCCAAT 
ATT ATGGAT ATCT TT GG AGTGAAGTATT T T CXAI GGAIATGTTTT ACAGC T GTT AAAAAALiAAGGGATA 
ATGAATCCGGAjGG TT GG AATGAAATAC AGAAACC7AATCCT GA A ACCT GGGGG A TOT! TGGACGGCATGGA 
CAT GCTCCACAAT T TCTT CAATXGTtSAGCCTxAACCAAAAASCGT T CCT AAT GAGTAGAGGCCT GCLATGCTC 
CG . 



The MOL9b protein encoded by SEQ ID NO:21 has 687 amino acid residues, and is 
presented using the one-letter code in Table 9E (SEQ ID NO:24). 



Table 9E. Encoded MOL9b protein sequence (SEQ ID NO:24) 

SRILLRMTL3REVM3 P1QAMS3 YWAGRNVLRWD1S PECIKTRTEELIVQT KQVyHAVQ^G IiJ KVTYtiNC 
LQAJJVDVSVK Y IVE RTML DFPQHV5S DKEVRAAS TFJOXRLSRFD I EMS MRGD I FERI VffLQET CDLGKI K 
PE ARRYLEKS I KMGKRH SLHLPE QVQN EI KSMKK RMS ELCI D FNKNLNE DOT FI.VFS KAEX GAL? D 3 FI DS 
L3KT D DDKYK I TL K Y PH YF P VMKKCCI PETRRRMEMAJ?NT RCKE E NT 1 1 L CQLLPLRT KVAKLLG Y STHAU 
F VLEMKT AKS TSRVT AFTi DDT *S QFCT «Kr LG 1AE RE 71 LNLKKKE CKDRGFE YDGKHJ ASfDLYYYMT QTEELK 
Y8I DQE TLKEY FP I EWT E GliLNTYQE LLGL S FEQMT DAHVTCN KS VTX YTVKDKATGE VLGQ FYL i ) I'.YVftffi 
GXYNHAACF3LQPGCLLP DGSIOSXIAVAALV/NFS QPVAGRPSLLRH DE V RT YF t-lE FGKVWtiQ I CAQT DFAR 
FS GTMVETDF VEVP S QML iNWVWDVD £ LRRLS KH YKBG3 PI AD DT J.EKLVASRJ •VHTGI/LTUlC I VLSKl'E 
QS L HT NT S LDAAS E YAKY C S E I LGVAAT PGT1SM PAT FGHLAGG YDGQYYGYUJSEn/ FS MDMFYS C FXKE GI 
MM PEVGKIKYRN L I LK PGG 5 LDSMDML'^NFLKRIl PHQXAFLM5 RGLHAP 
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MOL9c 

Fn the present invention, the target sequence identified previously, Accession Number 
1950671 9JB_EXT, was subjected to the exon linking process to confirm the sequence. PCR 
5 primers were designed by starting at the most upstream sequence available, for fllie forward 

primer, and at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is either unique or highly selective was encountered, or, 
in the case of the reverse primer, until the stop codon was reached Such primers were designed 

10 based on in silico predictions for the full length cDNA, part (one or more exons) of the DMA or 
protein sequence of the target sequence, or by translated homology of the predicted exons to 
closely related human sequences sequences from other species. These primers were then 
employed m PCR amplification based on lie following pool of human cDNAs: adrenal gland, 
bone marrow,- brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 

15 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, tetal lung, heart, 

kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, tbyroid, trachea, 
uterus. Usually the resulting ampKcons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences ftom all clones were assembled with themselves, with 

20 other fragments in CuraGen Corporation's database and with public HS'i s. Fragments and ESTs 
were included as components for an assembly when the extent of their identity wife another 
component of the assembly was at least 95% over 50 bp. In addition, sequence traces were 
evaluated manually and edited for corrections if appropriate. These procedures provide the 
sequence reported below, which is designated Accession Number CG56222-01 

25 Hie disclosed novel MOL9c nucleic acid of 2167 nucleotides (also referred to as 

CC56222-0 1 ) is show in Table 9F An open reading frame begins with an ATG initiation codon 
at nucleotides 16-18 and ends with a TGA codon at nucleotides 2 1 28-2 13 0. A putative 
untranslated region upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 9F, and the start and stop codons are in bold letters. 

30 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence has 2000 of 2167 bases (92%) identical to a gb:GENBANK- 
ID:AB000170|acc:AB000170.1 mRNA from Sus scrofe (Porcine mKNA for endopeptidase 
24.16, complete cds) (Expect = 0.0). 
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Table 9F. MOL9c Nucleotide Sequence (SEQ TD NO:25) 

CC TCTCAGC3CTCCCA TGAT C G CCCGGTG CCTTTTGGCFGT GCCAAG CCTCCGCAGAGT T GGTGGTT CC AG 
GATTTTACT CAGAATGACGT TAG GAAGAG A AGTGAT GTCTCCTCTT CAGGCAAT GTCTT CC TAT ACT GT G 3 
C TG CK^GAAATGT TTT AAGATGG GAT CTT TC ACCAGAGC AAATT AAAACAAGAACT SAGGAGCT CATTG TG 
C AGAC C AAACAOG TGT ACG AT GCT GT TGGAAT GCTCGG? AT TGAGGAAGT AACTTACGAG AACT GT CT3 C A 
GGCAC TGGCAGATGT AGAAGT AA AGT AT A1AGTGGA A AGG ACCATGCT AGACTT TCCCCAGCAT GT AT C CT 
CTGAJCPAAGAAGTACGAGCMCAASTACAGAAGCAGACAAAA^ 

ATGAGAGGAGATATAT TT GAGAGAAT TGT TCATTT ACAGGAAACC TGT GATCT GGG SAAGAT AAAAGCT G A 
GGCCASACGAT^TTGGAAAAGT CAATT AAAATGGGG AAAAG AAATGGGC T CC ATCT T CCTG AAC A AGT AC 
AGAAT GAAATCAAATC AAT GAAGAAAAGAAT GAGTGAGCT ATGTATTGAT TTTAAC AAAAACCTC AATSAG 
GAT GAT AG CTTCCT T GT AT TT T CCAAGGCT GAAC T TGGT GCTCTT CCT G ATGATTT CATT GACAG T TT AGA 
AAAJ3AC AGATGAT GACAAGTATAAAATTA-CCT T AAAATATCCACACT ATT TCC CTGTCATGAAGAAAT GTT 
GT A?C CCT GAAAC CAGAAGAAGGATGGAAATG GCTTT T AAT ACAAGGT G CAAAGAGGAAAACACCAT AA1 T 
TT GCAGCAGCTAC T CCCAC^GCGAACCAJlGGTGGCCAAACrACTCGGT T ATA3 CACACATGCTGACTT CGT 
CCTTGAAJVTC^^CTGCAAAGAGCAC^ 

AACCC I' TG SGTGAAG C ASAACGftGAGTT TAT TT T GAAT TTG AAGfiAAAAG GA^TGCAAAGACAGGGGT r TT 
GAA7AT GATGGGAAAATCA AT GCCTGGGAT CT AT ATT ACTACATGACTC AGACAjGAGGAACTCAAGTAT TC 
CAT AGACC AA(^GTT CCTCAAGGAAT ACT TCC C AATT GAG 3 tGGTCACT GAAGGCTTOCT GAAC ACCF ACC 
AG GAG T T GTTGGGACTT T CATTTtGAAC AJIATGACAG AT GCT C ATGTTT G GAAC AACAGTGT T ACAC TT* TAT 
AC TGT GAAJGGAT AAAGCT ACAG G AG AAGTAIT GGGAC AGT T CTA'J TTGG ACC T CT ATCC AAGGGA AGGAAA 
AT ACAATCAT GCGGC CTGCTTC G GTCT CC AGCCT GGC I GC CT TCTGCC T GATGG AAGCCGG ATGAT GGCAG 
T GGCTGCCXTTCGTG GTC7ACTTC TCAt *AGCC AG^GGC A(XtT CGTCC CTCT CT'CCl'CJAG ACACGACG AGG 1"G 
AGGACTTACT T TCAT GAG T TTGG TCACGT GAT GC ATCAGATTTGTGCACAGACT GATT T?GCACG ATTT AG 
CGGAACAAATGTGGAAACT GACTTTGT AGAG GTGC CATCGCAAATGCT TG AAAATCG GGTGTGGG ACGT CG 
AT TCCCTCOGAAGATT GT C AAAAC AT/I AT AAAGACG GPJ\G ClCC'lL^ TTC GAG ACGATCTGCT T GAA A AACTT 
GT"GCTTCTAGGCT GG*?CAACACAGGT CT TCTGACCCTG'23CCA3ATT GTTT T GAGC AAAGTTGATCAGTC 
T CT TC AT AOCAAC ACATC GCTGGAT GCTGC AAGC GAAT AT GCCAAATACT G CT CAG AAA7AT TAGGAGT TG 
CAGCT AC TGCAGGC ACAAATAI GCXIAGCT ACCT T TGG ACATTTG^K^GGG GGA r ACGATGGCCAA1' ATT AT 
GGATAT Cm C-GAGT GAAGTAT T TTCCAT GGAT ATGTT TT ACAGCTGT 7TTAA AAAAGAAGGGAT AATGAA 
TC CGGAGGTT GGAAT GAAAT ACRGAAACCT AAT C CTG AAACCT GGGGGAT CTCT GGAC GGCAT GGACAT GC 
TC CAC AAT^ 'CT^GAAACG TGAGCXAAACCAAAAAGC GT TCCTAAT GAGTAGAGGCCT GCAC GCTC CGTGA 
ACT GGG GATCTTTG GTAGCCGTCCATGTCTG GAGGAC , 



The MOL9c protein encoded by SEQ ID NO:25 has 703 amino acid residues, and is 
presented using the one-letter code in Table 9G (SEQ ID NO: 26). The SignalP, Psort and/or 
Hydropathy profile for MOLDc predict that MOL9c has a signal peptide and is likely to be 
5 localized at the cytoplasm with a certainty of 0.9200. The SignalP predicts a cleavage site at the 
sequence between amino acids 1 7 and 18. 



Table 9G. Encoded MOL9c protein sequence (SEQ TD NO:26) 

MI ARCLLAVRSLRRVGGSRI LLRMTLGREVK5PLQAM3 S YTVAGRN'/LRWDLSPEQI KTRT EB1»IVQTKQV 
YDAVCMLG IfciK VTYEN CLQALAD VE V KY 1 ViSKTML DF PQH VS S DK3 VRAASTE AEKRL SRF D I EMSMRGDI 
FF.RrVHLQETCDIiGKIK?EARRYLEK5, II<H6KRNGLtIL?EQV0NE IKSMK EFNKNL1IE 3DTFL 
VF S KAELGAL PC DTI DS LE KTDDDKYK I? L KYPH Y FP V <KKC C I P3TRRRM3MAENT R.CKE ENT 1 1 LQQLL 
PLRT KVAKLLGYS T HADFVLEMNTAKST S RVT AFL DDL 3 (2KLK PLGEAil H3 FI LNLKK KE CKDRG T3YDGK 
rNAHDLYYYMTQTEBLKYS I DQBF.LKE VFPI E WTEGIiLNTY QFJJjGL S 7ZQMT^AHVWNKSVTLYTVKDK 
ATGEVLGQ FYLDLYP REGKYNKAAC FGLQ P G C LL P DG 3 RMMAVAAL VVN J 5 QPVAGRP SLLRH DEVRTY FH 
E FGHVMHQTC AOTDFAR FS Gl'N VETO FVli VPS OMLENWVWDV D SLRRL S KH YKDG5 P I ADDLLEKL VAS RL 
VNTGT J .TdRQIVLS KVDQS LHTNT SLDAAS EYAKYCS EI I/Ty/AATPGTUMP AT FGHLAGGYDGQY Y GYLWS 
E V F SMCIMFY5 Z FKKEG IMN F Sl^Gt^IKY r.NL I LKPG G SXj DGMDMLHN FLKRE PNQKAFLM SRGLHA ? 



The full amino acid sequence of MOL9c was found to have 657 of 704 amino acid 

10 residues (93%) identical to, and 687 of 704 amino acid residues (97%) similar to, the 704 ammo 

acid residue ptnnS WISSPROT-ACC:P42675 protein from Oryctolagus cuniculus (Rabbit) 

(NEUROLYSIN PRECURSOR (EC 3.4.24.16) (NEUROTENSIN ENDOPEPTIDASE) 

68 



WO 01/S1578 



PCmSfH/13578 



10 



(MITOCHONDRIAL OLIGOPEPTIDASE M) (MICROSOMAL ENDOPEPTIDASE) (MEP) 
(E value = 0.0) 

MOL9c is expressed m at least the following tissues: Artery, Brain, Bronchus, Cartilage, 
Cervix, Colon, Coronary Artery. Dermis, Epidermis. Foreskin, Heart, Kidney, Liver, Ovary, 
Pancreas, Pituitary Glond, Placenta, Prostate, Salivary Glands, SynovmmASynovial membrane, 
Thalamus, Umbilical Vein, Uterus. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCaUing sources, Public EST sources, literature sources, and/or RACE sources. 

Possible SNPs found for MOL9c are listed in Table 9H. 



Table 9H: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


399 


99 


A> G 


858 


51 


OA 


863 


50 


T> A 


1242 


48 


T>C ' 


1810 


141 G > A 

1 1 


1824 


143 


T>C 


1892 


144 


T>C 



15 



20 



25 



These materials are further useful in the generation of antibodies that bind hnmuno- 
specifically to the novel MOL9c substances for use in therapeutic or diagnostic methods. These 
antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the? "Anti-MOLX Antibodies** section below. For 
example the disclosed MOL9c protein has multiple hydruphflic regions, each of which can be 
used as an hnmunogen. In one embodiment, a contemplated MOL9c epitope is from about 
amino acids 25 to 75. In another embodiment, a MOL9c epitope is from about amino acids 100 
to 200. In further embodiments, MOL9a epitopes aTe found in amino acids 250-400, 450-550, 
and 650-700. These novel proteins can also be used to develop assay system tor functional 



Homology between the MOL9 isoforms and other homologous proteins is presented 
graphically in the multiple sequence alignment given in Table 91 (with MOL9a being shown on 
line 1 , MOL9b on line 2, and MOL9c on line 3) as a ClustalW analysis comparing MOL9 with 
related protein sequences. 

69 



WO lll/8157» 



pcms(n/u57N 



Table 91. Information for the ClustalW proteins: 

1) MOL9a <SEQ ZD NO: 22) 

2) MOL9b (SEQ ID NO: 24} 

3) MCL9r. (SRQ TD WO:2fi! 

4) SNISSNEW-ACC:Q02O38 NZUROLYSIN PRECURSOR (SEQ ID KO:6E) 
a) Sf?ISS?BOT-ACC:742B7!» KEUKOLYSIN PRECURSOR (SEQ ID NO: 69] 

5) .SPTRRMBL-RCC: P79433 P.KDOP^PTTPAS^ 24.) 6 i.SRQ ID NO:7fl) 

6) SWISSPROT-ACC:?42676 KEUROL^SDJ PRECURSOR (SEQ ID KO:71- 

7) ACC:P477bS TUIKET OHGCPEPTIDASE (SEQ ID NO:/*) 



r 



40 50 
— HLTLD QQKSIILILF LI 



MDL9a Prot 
MOLdb Prot 
M0L9c Prot 
Q02039 
542675 
P79433 
P42676 
B477B8 



10. 
.1 . 



20 



30 
.1 . 



-HISRCUcAV 3Pte| 
-MIVRCLBA& 
-MIARCFSAV 



MVVPBGHLAR ELGATFS3SA P1GGKPFPEV WDCLSCKQGD WSQAREKFNa ERREi 
MIlXCLSTi 
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gi 1 7305353 1 HHQQ^K^rraKKPL 

gi 1 3927 3 06 | ^KG-lflRWiCQRAAWPGHAQNC 



Endopeptidasc 24.16 or mitochondrial oligopeptidase, abbreviated here as EP 24.16 
(MOP), is a thiol- and metal-dependent oligopeptidase that is found in multiple intracellular 
compartments in mammalian cells. From an analysis of the corresponding gene, we found that 
5 the distribution of the enzyme to appropriate subcellular locations is achieved by the use of 
alternative sites for the initiation of transcription. The pig EP 24.16 (MOP) gene spans over 100 
ldlobases and is organized into 16 exons. The core protein sequence is encoded by exons 5-16 
which match perfectly with exons 2-13 of the gene for endopeptidasc 24.1 5, another member of 
the thimet oligopeptidase family. These two sets of 1 1 exons share the same splice sites, 

LO suggesting a common ancestor. Multiple species of mRNA for EP 24. 1 6 (MOP) were detected 
by the 5-rapid amplification of cDNA ends and they were shown to have been generated from a 
single gene by alternative choices of sites for the initiation of transcription and splicing. Two 
types of transcript were prepared, corresponding to transcription from distal and proximal sites. 
Their expression in vitro in COS-1 cells indicated that they encoded two isoforms (long and 

15 short) which differed only at their amino feiTnini: the long form contained a cleavable 

mitochondrial targeting sequence and was directed to mitochondria; the short form, lacking such 
a signal sequence, remained in the cytosoL The complex structure of the EP 24.16 (MOP) gene 
thus allows, by alternative promoter usage, a fine transcriptional regulation of coordinate 
expression, in the different subcellular compartments, of the two isoforms arising from a single 

20 gene. PMID: 91 82559, UI: 97326108 We have isolated a metallopeplidase from rat liver. The 
peptidase is primarily located in the mitochondrial intermembrane space, where it interacts non- 
covalently with the inner membrane. The eirzyme hydrolyzes oligopeptides, the largest substrate 
molecule found being dynorphin A 1-1 7; it has no action on proteins, and does not interact with 
alpha 2-maetoglobulin, and can therefore be classified as an oligopeptidase. We term the enzyme 

25 oligopeptidase M. Oligopeptidase M acts similarly to thimet oligopeptidase (EC 3.4.24. 1 5) on 
bradykinin and several other peptides, but hydrolyzes neurotensin exclusively at the -Pro+Tyr- 
bond (the symbol + is used to indicate a scissile peptide bond) rather than the -Arg+Arg- bond. 
The enzyme is inhibited by chelating agents and some thiol-blocking compounds, but differs 
from thimet oligopeptidase m not being activated by thiol compounds. Hie peptidase is inhibited 

30 by Pro-lie, unlike thimel oligopeptidase, and (he two enzymes are separable in chromatography 

on hydroxyapatite. The N-teiminal amino acid sequence of rat mitochondrial oligopeptidase M 

contains 1 9 out of 20 residues identical with a segment of rabbit microsomal endopeptidase and 

1 7 matching the corresponding segment of pig-soluble angiotensin II-binding protein. Moreover. 
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the rat protein is recognized by a monoclonal antibody against rabbit soluble angiotensin II- 
binding protein, all of which is consistent with these proteins being species variants of a single 
protein that is a homologue of thimet oligopeptidase. The biochemical properties of the 
mitochondrial oligopeptidase leave us in no doubt that it is neiimrysin (RC 3.4.24.1 6), for which 
5 no sequence has previously been reported, and which has not been thought to be mitochondrial. 
PMID: 7836437, Ul: 95138171 We have isolated by immunological screening of a lambda 
ZAHI cDNA library constructed from rat brain mRNAs a cDNA clone encoding endopeptidase 
3.4.24.16. The longest open reading frame encodes a 704-amino acid protein with a theoretical 
molecular mass of 80,202 daltons and bears the consensus sequence of the zinc metalloprotease 

10 family. The sequence exhibits a 60.2% homology with those of another zinc metaQopeptidase, 
endopeptidase 3.4,24.1 5. Northern blot analysis reveals two mKNA species of about 3 and 5 
kilobases in tut brain> ileum, kidney, and testis. We have transiently transfected COS-7 cells with 
pcDNA3 containing the cloned cDNA and established the ovcrcxprcssiou of a 70-75-kDa 
hmnonoreacdve protein. This protein hydroryzes QFS, a quenched fluorimetric substrate of 

1 5 endopeptidase 3.4 .24.16, and cleaves neurotensin at a single peptide bond, leading to the 

formation of neurotensin (1-10) and neurotensin (1 1.-13). QFS and neurotensin hydrolysis are 
potently inhibited by the selective endopeptidase 3.4.24.16 dipeptide blocker Pro-Ilc and by 
dithiothreitol, while the enzymatic activity remains unaffected by phosphoramidon and captopril, 
the specific inhibitors of endopeptidase 3.4.24. 1 1 and angiotensin-converting enzyme, 

20 respectively. Altogether, these physicochemical, biochemical, and immunological properties 
unambiguously identify endopeptidase 3.424.16 as the protein encoded by the isolated cDNA 
clone. PMID: 7592986, UI: 96070836 A human genomic clone encompassing exons 1 -3 of the 
neurotensin/neuromedin N gene was identified using a canine neurotensin complementary DNA 
probe. Sequence comparisons revealed that the 120-amino acid portion of the precursor sequence 

25 encoded by exons 1-3 is 89% identical to previously determined cow and dog sequences and that 
the proximal 250 bp of 5' flanking sequences are strikingly conserved between rat and human. 
The 5* flanking sequence contains cis-regulatory sites required for the induction of 
neurotensin/neuromedin N gene expression in PCI 2 cells, including API sites and two cyclic 
adenosme-5'~monophosphate response elements. Oligonucleotide probes based on the human 

30 sequence were used to examine the distribution of neurotensin/neuromedin N messenger RNA in 
the ventral mesencephalon of schizophrenics and age- and sex-matched controls. 
Neurotensin/neuromedin N messenger RNA was observed hi ventral mesencephalic cells some 
of which also contained melanin pigment or tyrosine hydroxylase messenger RNA. Neurons 
expressing neurotensin/neuromedin N messenger RNA were observed in the ventral 
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mesencephalon of both schizophrenic and non-schizophrenic humans. PMID: 1436492, UI: 
93063S58 Neurotensin is a small neuropeptide of 13 amino acids that may function as a 
aeiuotransmitter or neuromodulator in the central nervous system. In the CNS, neurotensin is 
localized to the catecholamine-containing neurons. A catecholanaine-producing cell line can also 

5 produce NT. lithium salts, widely used in the treatment of manic-depressive patients, 

dramatically potentiate NT gene expression in this cell line. Gerhard et al. (1989) used a canine 
cDNA as a probe on a somatic cefl hybrid panel to determine that the human gene is located on 
chromosome 12. Ihe tridecapeptide neurotensin (162650) is widely distributed in various 
. regions of the brain and in peripheral tissues. In the brain, neurotensin acts as a neuromodulator, 

1 0 in particular of dopamine transmission in the nigrostriatal and mesocorticolimbic systems, 
suggesting its possible implication in dopamine-associated behavioral neurodegenerative and 
Fieuropsychialric disorders. lis various ellbcls are mediated by specific membrane receptois. Vita 
et al. (1993) isolated a cDNA encoding the human neurotensin receptor and showed that it 
predicts a 418-amino acid protein that shares 84% homology with the rat protein. Le et aL (1997) 

15 also cloned the human neurotensin receptor (NTR) cDNA and its genomic DNA. The gene is 
encoded by 4 exons spanning more than 10 kb. The authors identified a highly polymorphic 
tetranucleotide repeat approximately 3 kb from the gene. Southern blot analysis revealed that the 
NTR gene is present in the human genome as a single-copy gene. Le et al. (1997) stated that the 
neurotensin receptor has 7 transmembrane spanning regions and high homology to other 

20 receptors that couple to G proteins 

Neurolysin is expressed ubiquitously in the rat brain (Massarelli et al Brain Res 1999 
Dec 18; 851(1-2): 261-5; Dauch ei al. I Neurochem 1992 Nov; 59(5): 1862-7). It has brcn 
suggested that this enzyme plays a role in die regulation of neurologically active peptides 
(Vincant et ah Br J Pharmacol 1997 Jun; 1 21 (4): 705-10) and activity differs depending on the 

25 cellular source of this enzyme whether it is expressed m primary cultured neurons and astrocytes 
(Vincent et al. J Neurosci 1996 Aug 15; 16(16): 5049-59). This might play a role in nociception 
and signal transduction in the brain as well as central nervous system. Related endopeptidases 
have been shown to play a role in processing angiotensin and important regulator of blood 
pressure. 

30 Uses of the Compositions of the Invention 

The expression pattern, map location and protein similarity information foT MOL9 

suggest that it may function as neurolysin family. Therefore, the nucleic acids and proteins of 

the invention are useful in potential therapeutic applications implicated, for example but not 

limited to, hi various pathologies /disorders as described below and/or other 

35 pathologies/disorders. Potential therapeutic uses for the invention(s) are, for example but not 
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limited to, the following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody 
target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or 
prognostic marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) 
tifisuc regeneration in vitro and in vivo (regeneration for all these tissues and cell types 
5 composing these tissues and cell types derived from these tissues).These may also function m 
extracellular matrix remodeling in tissues described above. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies and disorders. For example, but not limited to, a cDNA encoding the neurolysm - 

10 like protein may be useful in gene therapy, and the neurolysm -like protein may be useful when 
administered to a subject in need thereof. By way of nonlimiting example, the compositions of 
the present invention will have efficacy for treatment of patients suffering from Cancer, Trauma, 
ViraLtoacteriaVparasitic infections, Cardiomyopathy, Atherosclerosis, Hypertension, Congenital 
heart defects, Aortic stenosis, Atrial septal defect (ASD), Atrioventricular (A-V) canal defect, 

15 Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, Ventricular septal defeat (VSD), valve 
diseases, Tuberous sclerosis, Scleroderma, Obesity, Transplantation, Atherosclerosis, 
Aneurysms, Hypertension, Fibromuscular dysplasia, Stroke, Scleroderma, Fertility, Diabetes. 
Von nippel-Lindau (VEIL) syndrome , Pancreatitis, Hirschsprung's disease , Crohn's Disease, 
Appendicitis, Alzheimer's disease, Stroke, Hypercalcemia, Parkinson's disease, Huntington's 

20 disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia- 
telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, Systemic 
lupus erythematosus , Autoimmune disease, Asthma, Emphysema, Scleroderma, Autoimmune 
disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney 
disease, Renal tubular acidosis, IgA nephropathy, Hypercalcemia, Lesch-Nyhan syndrome. The 

25 novel nucleic acid encoding the neuTolysin-like protein, and the neurolysin -like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. These materials are 
farther usefiil in the generation of antibodies that bind immunospecifically to the novel 
substances of the invention for use in therapeutic or diagnostic methods 

30 

MOL10 

MOLlOa 

A novel nucleic acid encoding a protein bearing sequence similarity to Cyclic- 
Nucleotide-Gated Olfactory Channel -like protein was identified by TblastN using CuraGen 
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Corporation's sequence file for MOL10 probe or homoiog, run against the Genomic Daily Files 
made available by GenBank. The nucleic acid was farther predicted by the program GenScan™, 
including selection of exam. Those were further modified by means of similarities using 
BLAST searches. The sequences were then manually corrected for apparent inconsistencies, 
thereby obtaining the sequences encoding the full-length protein. The disclosed novel MOLlOa 
nucleic acid of I S35 nucleotides (also referred to as GM98960647_A) is shown in Table 10A. 
An open reading frame begins with an ATG initiation codon at nucleotides 54-56 and ends with 
a TGA codon at nucleotides 1788-1 790. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon are underlined in Table 10A, and 
the start and stop cudora are in bold letters. 

The nucleic acid sequence has 1536 of 1733 bases (88%) identical to uRattus norvegicus 
Cyclic-Nucleotide-Gated Olfactory Channel ocnc2 mRNA (GENBANK-ID: U12623) {Expect - 
5.2e :08 ). 



Table 10A. MOLlOa Nucleotide Sequence (SEQ ID NO:27) 

T AC AG GGAG AGAGGGT GT GGACAT CT C^C ACGCC AGC ACC AGACCAC AGAACC ATGAGCCAGGACACCAAA 
GTGA AGAC A ACAKAGTC CAG7 CCC2CAGCCCCAT CCAAGG C CAGGAGGAAGTT GCTG CCTGTCCT G GACCC 
fiXGTGGGGftt TAClAC T ACTGGTG'SC TGAACACAAT GGTC T TCCC AGT CATGT ATAACCTGAT CAT C CT CG 
'f GCG CAGUoCCT GCTT C C CCGACTTG C^GC^ 1 CGGrl^^lVi , GG^^CCTGGTTGGT•GC TGGACT AGACSAGT 
G ACC TGCT AT ACCT ACT A GACATG2T GGTG CGCT TCCACACAGGTGGATTCTT GGAACAGGGCAT CC r GGT 
GG'_*G G ACAAGWrAGGATCTCGAG rCGCT ACGTT CGCACC T GGAGTT T CT TCT ? GGACCTGGC TTCCCTGA 
TGCCCACAGATGTGGT CT AGG1 (JG JGLTi'GCi GCGG GCACAC AUUJACCCTGAGGCTGAACCGCT TTCTCCGC 
GCGCCCOSCCTCTTCGA^C/^^ 

C AAGC TGAT<GCTTTACATTT TT GT CGTCA1 CC A T ?G 3 A AC AGCTGCCT ATACT TT 3CCCT AT CCC GGTAC C 
TGGGCTTCGGGC£I(^CGCATSGGTGTJ\^ 

T ACCTCTATAGCT T TT AC T T C T CCACGCT GAT ACTGAC TAC AGTGGGC GA'JACACCGCCGC CAGC CAG3 GA 
AGAAGAGT AC CTC TTCAT GGT GGGCG ACT TCCT GCT S GCC GTCAT GGGTT^CGCCACCAT CATGGGT AGC A 
T GAGCT CT GT CAT C T AC AACAT GAAC ACT GCAGATGC GGC T TTCT ACC CAGAT CATGCACT GGT GAAGAAG 
T ACAT*GAAGCTGCAG CAC G T C AACCGCAAGCT GGAG C G GCGAGTT ATT GACT GGT 1 ATCAGC ACCT GCAGAT 
C AACAAGAAGATGACCAACGAG G T AGCCATCT T ACAS CACTTGCCTGAGCGGCTGCGGGCAGAAG TGGCTZ 
TGTCrGTGCACCTSTCCACTCTGAGCC^ 

CTGGT GCT GAAGCTGCAGCCCCAGACOTACTCACCAGGT GAATAT GTATGCCG CAAAGGAGACATT GGC CA 
AGAGATGT AC ATC ATCCGAGAG GGI CAAC T GGCCGTGGT GGC AGATGAT GGTATC ACACAG T AT GC?GTGZ 
T CGGTGGAGGGCT CTACT "?T GG GGAGAT CAGC AT CAT C AACATCAAAG G TGG GAACATGT CT GG G AACCGC 
C EC AC AGC CAACATCAAGAGCCTA GGTT AT TCAG ACC T A T T CTGC CT G AGC AAGGAGGAC CT GC GGGAGGC 
3 CITGAGCX^GTAICCACAAGC ACASA_CCAI CAT GGAGGAGAAAGGACGT GAGATC CTGCTGAAAAT GAACA 
AGTT GGAC GTGAATGC T GAC GC AG CT GAGATC G C CCT GCAGGAGGCCAC AGAGTC CCGGCT ACGAGG CCTA 
GACX^GCAGCTGC-ATGATCT ACAGACCAAGTT T GCTCGCCTCCTG GCTGAGCT GGAGTCCAGCG CACT T AA 
G ATTGCTT ACCGCATT G AAC GGCT 3GAGT GGC AGACT CG&GAGTG GCC AATGCCC ^AGGACCT G GCCGAGG 
CTGATGAC GAGGG TGAGCCT GAGGAGGGAACT r C CAAAGAT GAAGAGGGCAGGGC CAGCCAG GAGGGACCC 
CCAC-GTCCAG AGTGACCCC AT CC CCATCCCCACGAT TC C£ ACCTCC^AGTCAATCCAGAG 



The MOLlOa protein encoded by SBQ ID NO:27 has 638 amino acid residues, and is 
presented using the one-letter code in Tabic 10B (SBQ ID NO:28). PSORT analysis predicts the 
protein of the invention to be localized m the plasma membrane with a certainty of 0. 6000. 
Using the SIGNALP analysis, it is predicted that the protein of the invention has a signal peptide 
with mosi likely cleavage site between positions 57 and 58. 
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Table 10B. Encoded MOT>l0a protein sequence (SEQ ID NO:28) 

MSQDT KVKTTE S3 P PAPS K^RKLLPVLD PSGDYYYlfWLN IMVFP VMYNL 1 1 LVCRAC FPD LCHGY IVAWL 
VLDYTS DLLYIiDMVl^FHTGSFl^K^-LWDKGRISSRYVRTWS FFLDIASLMPTDVVYVRLGPHTPTLF 
I»NR1 "L RAPRIiFEAFDRT E T RTAY PN AFRI AKLNLY I FV V I HNNSCiY FAX'S RYi>G i* tJRDAW V Y PDP AQP G? 
E RLRR QYL Y5 JYFS TLX LT T VSD? PP ? AR RER Y J iFMVG 15 Fiji AVMGFAT IMGS MS S V I YNMN7 ADAAFYPD 
H&LVKKYMKLQHVNRKLE RRV1 DWYQHLQ IN KKMTNEVAI LQHL P E RLRAEVAVS VHL ST L S RVQI JON CE 
A5LLE ECjVLKLCPQT YS PGEY VCRKGDIGQEN YI IRE GCLAWADDG J T QYAVLGAGL YFGE I S I IN IKGG 
NXSGN RRTANI KSLG YSDT ■ FC7 1 S KEDLREVLS EY r QAQT IME EKSREI LLKMN KLDVK AE A AE I Ai.CE ATE 
S RLRGL D02t>DDLQT KFARIiAELESSALKIAYRI ERL^JQTREK PM P E DZi A2ADBEG EP E E GTS K EGR 
A5QEGPFGPE 

The full amino acid sequence of the protein of the invention was found to have 1 068 of 
1649 amino acid residues (64%) identical to, and 1068 of 1649 residues (64%) positive with, the 
575 amino acid residue Cyclic-Nuclcotide-Gated Olfactory Channel ocnc2 subunit protein from 
5 Rattus norvegicus (ptnnSPTREMBL-ACC: Q64359) (E value = 5.50- 54 ), and .292 of 556 amino 
acid residues (52%) identical to, and 404 of 556 residues (72%) positive with, the 694 amino 
acid residue Cone Photoreceptor cGMP-Gated Channel Alpha Subunit Homo sapiens (Human) 
(ptnn TREMBLNEW -ACC: AAC17440) (E value - 5.8e- 157 ) 

MOLlOb 

10 In the present invention, the target sequence identified previously, MOLlOa Accession 

Number GM98960647_A (also known as CG54557-01), was subjected to the exon linking 
process to confirm the sequence. PCX primers were designed by starting at the most upstream 
sequence available, for the forward primer, and at the most downstream sequence available for 
the reverse primer. In each case, the sequence was examined, walking inward from the 

1 5 respective termini toward the coding sequence, until a suitable sequence that is either unique or 
highly selective was encountered, or, in the case of Hie reverse primer, until the stop codon was 
reached. Such primers were designed based on m silico predictions for the fall length cDNA, 
part (one or more exons) of the DNA or protein sequence o f the target sequence, or by translated 
homology of the predicted exons to closely related human sequences sequences Sum other 

20 species. These primers were then employed in PCR amplification based on the following pool of 
human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, 
fetal liver, total lung, heart, kidney, l5Tnphonw - Raji, mammary gland, pancreas, pituitary gland, 
placenta, prostate, salivary gland, skeletal muscle, small intestine, sphrnl cord, spleen, stomach, 

25 testis., thyroid, trachea, uterus. Usually the resulting amplicons were gel purified, cloned and 

sequenced to high redundancy. The resulting sequences from all clones were assembled with 

themselves, wilh other fragment!? in CunaGen Corporation's database and with public RSTs. 

Fragments and ESTs were included as components for an assembly when the exlent of their 

identity with another component of the assembly was at least 95% over 50 bp. In addition, 
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sequence traces were evaluated manually and edited for corrections if appropriate. These 
procedures provide the sequence reported below, which is designated MOLlOb (Accession 
Number CG54557-02). This differs from the previously identified sequence [GM98960647_A 
(also known as CG54557-01)] at aminoacid position 159T->Aandhas deletions at positions 22 
5 R,93Gaml426G. 

The disclosed novel MOLlOb nucleic acid of 2551 nucleotides (also referred to as 
CG54557-02) is shown in Table 9D. An open reading frame begins with an ATG initiation 
codon at nucleotides 779-781 and ends with a TGA codon at nucleotides 2504-2506. A putative 
untranslated region upstream from the initiation codon and downstream from the termination 
10 codon are underlined in Table 1 OA, and (he start and stop codons are in bold letters. 

The nucleic acid sequence has 11625 of 1857 bases (87%) identical to a gb:GENBANK- 
ID:RNU12425|acc:U12425.1 mRNA from Rattus norvegicus (Rattus norvegicus olfactory cyclic 
nucleotidc-gatcd channel mRNA, complete cds) (Expect = 4.0e" 316 ). 



Table 10D. MOLlOb Nucleotide Sequence (SEQ ID NO:29) 

GT TTTTGTT STTTT GAT ATAGG A3ATATTGAAG C AGG TTCACAAAAAGAGA AA AGTT G A AAGA? TGGGGAC ' 
C ATAAAAC AC APG 5AAT GGCT GGT AGGA.TC AG^C ACT AGAAGTCACAAG A AGG RTATG AGG AC A A A A QCAC 
C ATAG GATC S CCC C TAT CACACTACCTAT GAGAAT GG T GTGATGGGGG AA G GC GTAT G TGGAGGT AGAXAA 
; GGGTAGGAA3TAGGT T ACAAAAAT AGAGCT CAC T T CTC ATGTG AGAGGC AT CT CTC'TGTCC CT GG AGAATA 
■ GT TT AGC AC Z TGACA? AGA? AA3CCATTC AC-T AAT AG T T GTTAAATAAAT A AA TAGT G AGG CC CA £ ATAGA 
. AS TT SCAAASAL'AAAAC AGAGT 5 TTTGATCCT ACACT AAAAC TGAGGT C T T CT GACCCAGAEGACACCTAT 
GTAGCTCAG J TGC T GIHSGAAAG AGGGGAG GAGGAAAACAGAGACAAGAC V CAGGCTTCCCT CT GAGGCATG : 
CAC CCC C ACCTTC TCCAG GGAT CTCAT TAG AGGT G*T T TAGCT GGGCAGGT GT AAG CCC AGGCCCT GGGAGA 
CAGGG CAGAGTGCTAGAGCTASACTGTCTC CACCCCTTC AGT AGC GCTAGCTCTGGTT GTGTTGC TAAGAG 
CC CCAAAGAGAAAGAAGT CACAGCAGAAGC CCAAC AGC AGCCTCC T TCAGGCAGT CAG GCAC IAGTGCCCA 
ACTCCAGAAGTCCCCTACAGGCAiGAG^GGTGTGGACATCTCACACCCC^ 
ZwSCCAGCACACCAAAC^CAAGACAACAGAGTCCAGTCCCCCAGCCCC^ 

TGT CC TGGACCCATCT GGGGATTACT ACTAC TCG TGGCTGAACACAACGGTCTTCCCAGTC ATGT ATAACC 
TC ATCflTCCTCGTGTGCAGAGCCTGC TTCCCCGACHT 5CAGCACG GT CATCTGCTGGC CTGGTTG STGCTG 
GACT AC AC GAGTGACC TG C TAT ACX^f ACT AGACATCG TGGTG CGC T T CCAUVCAGGAT TCTT TGGAACAGGG 
CATCCT GGTGGTG G AC AAGGGT AGGATCT CGAGT CGC T ACGT T CGCACCT GGAGTTT C T TCTT GG ACCTGG 
ClTCCCTGATGCXCACAGATGTGGTCTACGTGCGGCTG^ 
' T 1 TCT CCCCGCCC CCCC CC7CT T CGAGGC CTTCGACC GCGCAGAG AC CC GC ACAGCT T ACCC AAAT GCCTT 
• T CGC ATTGCCAAGC T GAT GCTTT ACATTT T 1 GTCGTCATCC AT TGGAACAG CT GCCTATACT T TGCCCT AT 
. C CCGGTACCT GGGC T CX? GGGCGT SACGCAT GGGT GT AC CCGGACCCCGCGCAGCCTGGCT^ T GAGC GCCTG 
i C GGC SCCAGT ACC T C r J AT AGCT r TTACTT CTCCACGC TGATAC TGACT AC AGTGGGCGATACACCG CCGCC ! 
AGCCAGGG AAGAA3 AGT ACCTCTTCATGGTGGGC GAC TT CC7PGCT GGCCG t CATGGGT TT CGCCAC CATCA : 
T GGGT AGCAT GAG C T CTGTCATC I ACAAC ATGAACAC TGCAGAT GCGGCT TTCTACCCAGATCAT G CACTG 
GT GAAGAAGI ACAI GAAG^jGC AGCACGTCAAC C G CAAG CTGGAG CGGCG AGT T ATTGAC"GGT ATCAGCA ■ 
CCTCCAGATCAACAAGAAGATGACCAACG^ 

AAGT GGCTSTGTC TGTGCACCT GTCC ACT CTGAGCCGGGTGC AGATCTT T CAGAACT G T GAGGCC AGCCTG 
CTGGAGGAGCTGGT GCT G AAGCTGCAG CCCCAGACCT ACTCACCAG G TX3 AATATGT ATG CC6CAAA£GAGA 
C ATTGGCC AAGAGATGT ACATC ATCCGAGAGGGT CAACT GGCCGTGGTGGC AG ATGAT GGT AT CACACAGT 
ATGCT GTGCTCGGT GCAGGGCTCTACTTT GGGGAGAT CAGCAI CATC AACATC AAAGGGAACAT GTCTGGG 
TiACC G C CGCACAG CC AACAT CAAG&GCCT AGGT T ATT CAGACCTATT CT G C CT G AGCAAGGAGG AC CTGCG 
3G AGGTGCTG AGC GAGT ATCCACAAGCACAGACCATCAT GGAG GAGAAAG GACGTGAGATCCTGCTGAAAA 
T GAACAAGT T GGACGTGAAT GCTGAGGCAGCTGAGATCGCCCT GCAGG AG G CCACAGAGTCCC GGCT ACGA 
G GCC T AGACCAGC AGCT GGATGAlt^TACAGACCAAG'r TTGCTC GCCTCCf G GCTGAGCTGGAGT CCAGCGC 
ACT^A AG AT TGCT T ACCGCATT GAACGGCTCtGAGT GGCAjGACT CGAGAGTGGC CAATGCCCGAGGACCTGG 
CT<3AGGCTX*ATGACGAGGGTGAGCCTGAGGAGGG^ , 
GGACCCCCAGGTCCAGAGTGACCO^^^ j 

15 ' ' " ~ " ~~ 
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The MOL9a protein encoded by SEQ ID NO:29 has 575 amino acid residues, and is 
presented using the one-letter code in Table 10B (SEQ ID NO:30). PSORT analysis predicts the 
protein of the invention to be localized in the plasma membrane with a certainty of 0. 6000. 
Using the SIGN ALP analysis, it is predicted that the protein of the invention has a signal peptide 
5 with most likely cleavage site between positions 56 and 57 (CRA-CF) 



Table 10E. Encoded MOLlOb protein sequence (SEQ ID NO30) 

LDYTS DLLYLLEMVVRFHTG7L3 QGI L W DKGRI S S R YVRTWSF FX DLAS LMP T DWYVRIiG PET P TLRLN 
RFLRAPRLFEAEDRAE7RTAY PNAFR IAKLMLY I FW 1 CLY ?AL SRYLS 7GRDAWVY P D PAQ ?GFER 
LRRQYL Y S FY FST L I L?T VGDT PPPA"HEE E YX FMVGDFLIAVMG FATI MGSMS S VI YKMNT ADAAFY P DHA 
LVKKYKKLQI IVNRKLERRV I DW YQHLQIK KE&OT EVTtI LQHL PEtlLRAEVAVS VHLST LSRVQ Z FQNCEAS 
LL EE LVLKLQ PQT YS PGE YVCRKGD1GQEMY 1 1 REGQLAWAE CGI TQYAVLGASLYF GE I S " I N I XGNMS 
GN RRT AN IKSLGYSDLFCLS KEDLREVLS E Y PQAQT I M2EKGRE I LLKMN KLD VNAEAAE IALQEATE S RL 
RG LDQQ1 DDL0TKFARLLAELE3 S ALK IAi'R IE RLEWQrRER EMP E DLAE ADDE3EPIL EGT S XDEE GRASC 
EG?PGP£ . 

The mil amino acid sequence of the protein of the invention was found to have 536 of 
575 amino acid residues (93%) identical to, and 552 of 575 ammo acid residues (96%) similar to, 
10 the 575 amino acid residue ptnr:SWISSrROT-ACC:Q64359 protein from Rattus norvegicus 
(Rat) (CYCLIC-NUCLROTTDE-GATED OLFACTORY CHANNEL OCNC2 SUBUNIT) (E 
value = 4Je- 2S7 ) 

Chromosomal information: 

The Cyclic-nucleotide gated olfactoiy channel ocnc2 disclosed in this invention maps to 
1 5 chromosome I 1 . This information was assigned using OMIM, the electronic northern 
bioinformatic tool implemented by CuraGen Corporation, public ESTs, public literature 
references and/or genomic clone homologies. This was executed to derive the chromosomal 
mapping of the SeqCallmg assemblies, Genomic clones, literature references and/or EST 
sequences that were included in the invention. 

20 Tissue expression 

The Cychc-nucleotide gated olfactory channel ocnc2 disclosed in this invention is 
expressed in at least the following tissues: Adrenal gland, bone marrow, brain - amygdala, brain 
- cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal 
brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland. 
25 pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, 
spinal cord, spleen, stomach, testis, thyroid, trachea, uterus. . This ^formation was derived by 
determining the tissue sources of the sequences that were included in the invention including but 
not limited to SeqCalling sources. Public EST sources, and/or RACE sources. 
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15 



In addition, the sequence is predicted to be expressed in the following tissues because of 
the expression pattern of (GENBANK-LD: ^5:GENBANK-ID:KNU12425jacc:U12425.1)a 
closely related Rattus norvegicus olfactory cyclic nucleotide-gated channel mRNA, complete cds 
honiolog m species Rattus norvegicus ; olfactory neuroepxthelium. 

These materials are farther useful in the generation of antibodies that bind immuno- 
specifically to the novel MOLlOb substances for use in therapeutic or diagnostic methods. 
These antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOLlOb protein has multiple hydrophilic regions, each of which can be 
used as an immunogen. In one embodiment, a contemplated MOLlOb epitope is from about 
ammo acids 25 to 75. In another embodiment, a MOLlOb epitope is from about amino acids I to 
30. In further embodiments, MOLlOb epitopes are found in amino acids 150-250, 275-350, 375- 
400. and 425-560. These novel proteins can also be used to develop assay system for functional 



Homology between the MOL10 isoforms and other homologous proteins is presented 
graphically m the multiple sequence alignment given in Tabic 9T (with MOLlOa being shown on 
line 1 and MOLlOb on line 2) as a ClustalW analysis comparing MOL10 wilh related protein 
sequences. 



20 Table 10C. Information for the ClustalW proteins: 

1) MOLlOa (SEQ ID NO: 28 J 

2) MOLlOb (SSQ ID NO: 30) 

3) S3S691 cyclic nuclectide-cated channel protein - rabbit tSZQ ID NO: 73) 

4) Q6435S Cyclic-tftacleotide-Gated Olfactcry Channel ocnc2 subunit protein from 
25 Rattus norvegicus (SEQ ID NO: 74) 

5) AAC17440 cone Photoreceptor cGHF-Gatcd Channel Alpha Subunit Homo sapiens 
(SEQ ID NO: 75) 



WCLlOa Pro 
MOLlOb Pro 
S35S91 
Q64359 
AAO.7440 



MOfclOa Pro 
MOLlOb Pro 
S35691 
QS4359 
AACT7440 



MOLlOa Pro 
NDLlOb Pro 
635691 
0.64359 
AAC17440 



■I- 



10 



20 



.1. 



30 
-1 - 



40 



■ I- 



50 



60 



.1. 



MS3WR3CARA EL3GSAW31R3 AATRRGRRCTi KTKRKRH3SG KGTPMQSTQ2 STRRRAQTPC 
MAKIHTOXSH E-S LKVKJSDRDL NRAEKGL5RA KSS SEE! S3VLQPOIAM 



1 
1 

50 
1 

52 



70 



30 



90 
.1 . 



100 
.1 .... I 



..I. 



113 
• . k . 



120 
...I 



ESTGHTCTHMT KKSNGVK3SP &HNHNWHVPA TIKAHSKDES FiTRSR-PQSA. AJ30C-BSELC 

ETRG LA DSGQeSETGQ GIARLSRLIa ICRRBAARHV K^^QGFLteb' ^ORFRGASLK 

180 



11H 

5 

103 



■ I ■ 



130 
■ I • 



140 



150 
.♦I . 



KVKT?E5sPE 
KVKl'VilS 1 ¥k 



EdiAEMpA PQQ RRGGFRRIVR LVGVIRQWAN RN5HEEEARP DSFj 
jEflS^Q&AQ AN VGSOeBtvCHG K£AW| 
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MOUOa Pro 
MDLlOb Pro 
S3S691 
Q64359 
A&C17440 



MDLlOa. Pro 
KJLlOb Pro 
835691 
Q64359 




DGKGDKDGD3 KSTKKEFEiF 
NKTEEE KKTKKK-DAI 




310 



320 
■ I 



330 



340 
.1 . 



350 



360 
..I 




490 
..I . 



500 



510 



52 C 
..I • 



530 
■ I 



E40 
.1 



MDLlOa Pro 
MOLlOb Pro 
S33691 
; Q64359 
»ftCl74«0 



MOLlOa Pro 
MOIXOo Pxo 
835691 
Q64359 
A&C17440 



dkyqklc'isk 
d^yqhlqiSk 



Di'JYQHLC'IEK 




710 720 

I ....I.. ..l 

" ' 54 f 
543 

VKMKQgTEPD 715 
543 

GGDIC 575 



KOI4IOB Pro 
MOLlOb Pro 
£35691 



730 

.1 



740 
.1 



750 
.1 



5DGMKSP 
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AAC1744D 




1(3 57 5 

— 594 
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Cyclic nucleolide-gated (CNG) channels play central roles in visual and olfactory signal 
transduction. In the retina, rod photoreceptors express the subunits CNCalphal and CNCbetala. 
In cone photoreceptors, only CNCalpha2 expression has been demonstrated so far. Rat olfactory 
sensory neurons (OSNs) express two homologous subunits, here designated CNCalpha3 and 
CNCalpha4. This paper describes the characterization of CNCbetalb, a third subunit expressed 
in OSNs and establishes it as a component of the native channel. CNCbetalb is an alternate 
splice form of the rod photoreceptor CNCbetala subunit. Analysis of mRNA and protein 
expression together suggest co-expression of all three subunits in sensory cilia of OSNs. From 
single-channel analyses of native rat olfactory channels and of channels expressed 
heterologous^ from all possible combinations of the CNCalpha3, -alpha4, and -beta lb subunits, 
we conclude that the native CNG channel in OSNs is composed of all three subunits. Thus, CNG 
channels in both rod photoreceptors and olfactory sensory neurons result from coassembry of 
specific alpha subunits with various forms of an alternatively spliced beta subunit 
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Phototransduction is mediated by an enzymatic cascade that ultimately leads to the 
hydrolysis of cGMP. The photoreceptor cells, rods and cones, integrate and respond to cCMP 
hydrolysis via a cGMP-gated cation channel in the plasma membrane of the outer segment 
Kaupp et al. (1 989) cloned this channel from bovine retina. Dhallan et a1 . (1 991 ) used the bovine 
5 sequence to isolate cDNA and genomic DNA encompassing the entire protein coding region of 
the human homolog. Assignment to chromosome 4 was achieved by study of somatic ceil 
hybrids. Pitder et al. (1 992) determined the primary srmcmres of the human and mouse retinal 
rod cGMP-gated cation channel by analysis of cDNA clones and amplified DNA The open 
reading frames predicted polypeptides of 690 and 683 residues, respectively, exhibiting 88% 

10 sequence similarity. Significant sequence similarity (59%) of the visual cGMP-gated channel to 
the olfactory cAMP-gated channel was pointed out The RNA transcript was found to be 3.2 kb 
long in human, mouse, and dog. By PCRused in connection with somatic cell hybrid DNAs, 
Pittler et al. (1992) mapped the CNCG gene to 4pl4-ql3 near the centromere. By interspecific 
backeross haplolype analysis, (he corresponding gene in (he mouse, Cncg, was mapped to a site 

15 0.9 cM proximal to the Kit locus on chromosome 5. Griffin et at (1 993) mapped the CNCG 1 
gene to 4pl2-cen by fluorescence in situ hybridization It is noteworthy that the rod cGMP PDE 
beta polypeptide (PDEB; 180072) also maps to 4p, at 4pl63. 
Uses of the Compositions of the invention 

The protein similarity information, expression pattern, and map location for MOL1 0 

20 suggest that it may have important structural and/or physiological functions characteristic of the 
Cyclic-nucleotide gated channel family. Therefore, the nucleic acids and proteins of the 
invention are useful in potential diagnostic and therapeutic applications and as a research tool* 
These include serving as a specific or selective nucleic acid or protein diagnostic and/or 
prognostic marker, wherein the presence or amount of Hie nucleic acid or the protein are to be 

25 assessed, as well as potential therapeutic applications such as the following: (i) a protein 

therapeutic, (ii) a small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, 
drug targeting'cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene 
ablation), and (v) a composition promoting tissue regeneration in vitro and in vivo (vi) biological 
defense weapon, 

30 The nucleic acids and proteins of the invention are useful in potential diagnostic and 

therapeutic applications implicated in various diseases and disorders described below and/or 
other pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering itom: 
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colorblindness, CNS developmental disorders and other diseases* disorders and 
conditions of the like. 

These materials are further useful in the generation of antibodies that bind 
iimnunospecifically to the novel substances of the invention for use in therapeutic or diagnostic 
5 methods. 

A summary of the MOLX nucleic acids and proteins of the invention is provided in Table 

11. 



TABLE 11: Summary Of Nucleic Acids And Proteins Of The Invention 



Name 


Tables 


Clone; Description of Homolog 


Nucleic 
Add 

SEQID 
NO 


Amino 
Add 

SEQID 
NO 


MOL1 


1A, IB, 


MOL1: GM_79960178 


1 


2 


MOL2 


2A..2B 


MOL2: 20466828_EXT1 


3 


4 


MOL3 


3 A, 3B 


MOL3: 82254077.0.1 


5 


6 


MOL4 


4A, 4B. 


MOL4: AC004826 


7 


8 


MOL5 


5A, 5B ; 


MOL5: AC025535 


9 


10 


MOL6 


6A.6B 


MOL6a: GM_87760758_A 


11 


12 




6D.6F. 


MOT^b: GM_877607S8_A_da 


13 


14 


MOL7 


7A.7B 


MOL7: 30675745.0.499 


15 


16 


MOL8 


8A, SB 


MOL8a: 11800699-0-16 


17 


18 




8D.8E 


MOL8b: CG56222-01 


19 


20 


MOL9 


9A,9B 


MOL9a: 195067 19_B_EXT 


21 


22 




9D,9E 


MOL9b: 1 95067 19_B„EXI-S773 


23 


24 ! 




9F, 9G 


MOL9c: CG56222-01 


25 


26 


MOL10 


10A, 10B 


MOL1 0a GM98960647_A 


27 


28 j 




10D, 10E 


MOL10bCG54557-02 


29 


30 | 



10 MOLX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
MOLX polypeptides or biologically active portions thereof. Also included in fee invention are 
nucleic acid fragments sufficient for use as hybridization probes to identify MOLX-cncoding 
nucleic acids (&g. 9 MOLX mRNAs) and fragments for use as PCR primers for the amplification 
1 5 and/or mutation of MOLX nucleic acid molecules. As used herein, the term Nucleic acid 
molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA), RNA 
molecules {e.g. y mRNA), analogs of the DNA or RMA generated using nucleotide analogs, and 
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derivatives, fragments and homologs thereof. The nucleic acid molecule may be single-stranded 
or doubk-stianded, but preferably is comprised double-stranded DNA. 

An MOLX nucleic acid can encode a mature MOLX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
5 naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of ncmlimiting example, the full-length 
gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more naturally 

1 0 occurring processing steps as they may take place within the cell, or host cell, in which the gene 
product arises. Examples of such processing steps leading to a "mature" form of a polypeptide 
or protein include the cleavage of the N-terminal methionine residue encoded by the initiation 
codon of an ORF, or the proteolytic cleavage of a signal peptide or leader sequence. Thus a 
mature form arising from a precursor polypeptide or protein that has residues 1 to N, where 

1 5 residue 1 is the N-ternrina1 methionine, would have residues 2 through N remaining after 

removal of the N-terminal methionine. Alternatively, a mature form arising from a precursor 
polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence from 
residue 1 to residue M is cleaved, would have the residues from residue M+l to residue N 
remaining. Further as used herein, a "mature" fcrai of a polypeptide or protein may arise from a 

20 step of post-translational modification other than a proteolytic cleavage event. Such additional 
processes include, by way of non-limiting example, glycosylation, myristoylation or 
phosphorylation. In general, a mature polypeptide or protein may result from the operation of 
only one of these processes, or a combination of any of them. 

The term "probes" as utilized herein, refers to nucleic acid sequences of variable length, 

25 preferably between at least about 10 nucleotides (nt), 100 nt, or as many as approximately, e.g. , 
6,000 nt, depending upon the specific use. Trobes are used in the detection of identical, similar, 
or complementary nucleic acid sequences, linger length probes are generally obtained from a 
natural or recombinant source, axe highly specific, and much slower to hybridize than shorter- 
length oligomer probes. Probes may be single- or double-stranded and designed to have 

30 specificity in PCR, membrane-based hybridization technologies, or EL1S A-like technologies . 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is separated 
from other nucleic acid molecules which are present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid 
(i.e., sequences located at the 5 - and 3'-tennini of the nucleic acid) in the genomic DNA of the 
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organism from which the nucleic acid is derived For example, in various embodiments, the 
isolated MOLX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, I kb, 
0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, 
5 spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by recombinant 
techniques, or of chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the invention, e.g., a nuclcicacid molecule having the 
nucleotide sequence ofSEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a 

10 complement of this aforementioned nucleotide sequence, can be isolated using standard 

molecular biology techniques and the sequence information provided herein. Using all or a 
portion of Ac nucleic acid sequence ofSEQ ID NOS:l, 3, 5, 7, 9, 1 L 13, 1 5, 17, 1 9, 21, 23, 25, 
27, and 29 as a hybridization probe, MOLX molecules can be isolated using standard 
hybridization and cloning techniques (e>g„ as described in Sambrook, et al. 3 (eds.), Molecular 

1 5 Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989; and Ausubel, et a/., (eds.), CURRENT Protocols in Molecular BlOT ,ogy, 
John Wiley & Sons, New York, NY, 1993 .) 

A nucleic acid of the invention can be amplified using cDNA, niRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 

20 amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 
MOLX nucleotide sequences can be prepared by standard synthetic techniques, e.g. 9 using an 
automated DNA synthesizer. 

As used herein, tlis term "oligonucleotide" refers to a series of linked nucleotide residues, 

25 which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction. 
A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA 
sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or 
complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions 
of a nucleic acid sequence having about 10 nt, 50 nt, or 1 00 nt in length, preferably about 1 5 tit 

30 to 30 nt in length. In one embodiment of the invention, an oligonucleotide comprising a nucleic 
acid molecule less than 100 nt in length would further comprise at least 6 contiguous nucleotides 
of SEQ ID NOS: 1,3, 5, 7,9, 11, 13, 15, 17, 19,21,23,25, 27, and 29, or a complement thereof. 
Oligonucleotides maybe chemically syntheb-ized and may also be used as probes. 
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In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID NOS: 1 , 
3* 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a portion of this nucleotide sequence 
a fragment that can be used as a probe or primer or a fragment encoding a biologically-active 
5 portion of an MOLX polypeptide). A nucleic acid molecule (hat is complementary to the 

nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 1 9, 21 , 23, 25, 27, and 
29 is one mat is sufficiently complementary to the nucleotide sequence shown in SEQ ID 
NOS:l,3,5,7, 9, 11, 13, 15, 17, 19, 2L 23, 25, 27, and 29 that it can hydrogen bond with little 
or no mismatches to the nucleotide sequence shown SEQ ED NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 

10 2L 23, 25, 27, and 29, thereby forming a stable duplex. 

As used herein, the term "complementary ' refers to Watson-Crick or Hoogsteen base 
pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means the 
physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van der 

1 5 Waals, hydrophobic interactions, and the like. A physical interaction can be either direct or 
indirect Indirect interactions may be through or due to the effects of another polypeptide or 
compound. Direct binding refers to interactions that do not take place through, or due to, (he 
effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

20 Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 

acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization 
in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 
respectively, and are at most some portion less than a full length sequence. Fragments may be 
derived from any contiguous portion of a nucleic acid or amino acid sequence of choice. 

25 Derivatives are nucleic acid sequences or amino acid sequences formed from the native 

compounds either directly or by modification or partial substitution. Analogs are nucleic acid 
sequences or amino acid sequences that have a structure similar to, but not identical to, the native 
compound but differs from it in respect to certain components or side chains. Analogs may be 
synthetic or from a different evolutionary origin and may have a similar or opposite metabolic 

30 activity compared to wild type. Homologs are nucleic acid sequences or amino acid sequences 
of a particular gene that are derived from different species. 

Derivatives and analogs may be full length or other than fell length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules 
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comprising regions that are substantially homologous to the nucleic acids or proteins of the 
invention, in various embodiments, by at least about 70%, 80% s or 95% identity (with a 
preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or 
when compared to an aligned sequence in which fee alignment is done by a computer homology 
5 program known in ihe ail, or whose eiiunliiig nucleic acid is capable of hybridizing to tibe 
complement of a sequence encoding the aforementioned proteins under stringent moderately 
stringent, or low stringent conditions. See e.g. Ausubel, et al, CURRENT PROTOCOLS in 
MOLECULAR BIOLOGY, John Wiley & Sons, New York, NY, 1993, and below. 

A '"homologous nucleic acid sequence" or "homologous amino acid sequence," or 

10 variations thereof, refer to sequences characterized by a homology at the nucleotide level or 

annuo acid level as discussed above. Homologous nucleotide sequences encode those sequences 
coding for isoforms of MOLX polypeptides. 1 so forms can be expressed in different tissues of 
the same organism as a result o£ for example, alternative splicing of RNA. Alternatively, 
isoforms can be encoded by different genes. In the invention, homologous nucleotide sequences 

1 5 include nucleotide sequences encoding for an MOLX polypeptide of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g. , frog, mouse, rat, rabbit, dog, 
cat cow, horse, and other organisms. Homologous nucleotide sequences also include, but are not 
limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set 
forth herein. A homologous nucleotide sequence does not, however, include the exact nucleotide 

20 sequence encoding human MOLX protein. Homologous nucleic acid sequences include those 
nucleic acid sequences that encode conservative amino acid substitutions (see below) in SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, as well as a polypeptide possessing 
MOLX biological activity. Various biological activities of the MOLX proteins are described 
below. 

25 An MOLX polypeptide is encoded by the open reading frame COKF*) of an MOLX 

nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a full protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or TGA. 

30 For the purposes of this invention, an ORF may be any part of a coding sequence, with or 

without a start codon, a stop codon, or both. For an ORF to be considered as a good candidate 
for coding for a bona fide cellular protein, a minimum size requirement is often set, e.g., a stretch 
of DNA that would encode a protein of 50 amino acids or more. 
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The nucleotide sequences detennined from the cloning of the human MOLX genes 
allows for the generation of probes and primers designed for use in identifying and/or cloning 
MOLX homologues in other cell types, e.g. from other tissues, as well as MOLX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
5 oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence dial 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 ot 
400 consecutive sense strand nucleotide sequence of SEQ IDNOS:l, 3, 5, 7, 9 } 1 1, 13, 15, 17, 
19, 21, 23, 25, 27, and 29; or an anti-sense strand nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 
9,11, J 3, 15, 17,19,21,23,25, 27, and 29; or of a naturally occurring mutant of SEQ ID 

10 NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29. 

Probes based on the human MOLX nucleotide sequences can be used to detect transcripts 
or genomic sequences encoding the same or homologous proteins. In various embodiments, the 
probe further comprises a label group attached thereto, e.g. the label group can be a radioisotope, 
a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as a part 

15 of a diagnostic test kit for identifying ceDs or tissues which mis-express an MOLX protein, such 
as by measuring a level of an MOLX-encoding nucleic add in a sample of cells from a subject 
e.g. , detecting MOLX mRNA levels or determining whether a genomic MOLX gene has been 
mutated or deleted. 

"A polypeptide having a biologically-active portion of an MOLX polypeptide" refers to 
20 polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 

polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 
active portion of MOLX" can be prepared by isolating a portion SEQ ID NOS:l, 3, 5, 7, 9, 11, 
13, 15, ) 7, 19, 21, 23, 25, 27, and 29 (hat encodes a polypeptide having an MOLX biological 
25 activity (the biological activities of the MOLX proteins are described below), expressing the 
encoded portion of MOLX protein {e.g. , by recombinant expression in vitro) and assessing the 
activity of the encoded portion of MOLX. 

MOLX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the nucleotide 
30 sequences shown SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 ? 15, 17, 19, 21, 23, 25, 27, and 29 due to 

degeneracy of die genetic code and thus encode the same MOLX proteins as that encoded by the 
nucleotide sequences shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 
29. Li another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide 
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sequence encodiag a protein having an amino acid sequence shown in SHQ ID NOS:2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. 

In addition to the human MOLX nucleotide sequences shown in SEQ ID NOS: 1, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29 it will be appreciated by those stalled in the art that 
5 DMA sequence polymorphisms that lead to changes in the amino acid sequences of the MOLX 
polypeptides may exist within a population (eg., the human population). Such genetic 
polymorphism in the MOLX genes may exist among individuals within a population due to 
natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an MOLX protein, 

10 preferably a vertebrate MOLX protein. Such natural allelic variations can typically result in 
1-5% variance in the nucleotide sequence of the MOLX genes. Any and all such nucleotide 
variations and resulting amino acid polyrnorphisms in the MOLX polypeptides, which arc the 
result of natural allelic variation and that do not alter the functional activity of the MOLX 
polypeptides, are intended to be within the scope of the invention. 

1 5 Moreover, nucleic acid molecules encoding MOLX proteins from other species, and thus 

that have a nucleotide sequence thai differs from the human sequence SEQ ID NOS:l, 3, 5, 7, 9, 
11, 13, 15, 17 5 19. 21, 23, 25, 27, and 29 are intended to be within the scape of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the MOLX 
cDNAs of the invention can be isolated based on their homology to the human MOLX nucleic 

20 acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention 
is at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 

25 21, 23, 25, 27. and 29. In another embodiment, the nucleic acid is at least 10, 25, 50, 100, 250, 
500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet another embodiment, an 
isolated nucleic acid molecule of the invention hybridizes to the coding region. As used herein, 
the term "hybridizes under stringent conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences at least 60% homologous to each 

30 other typically remain hybridized to each other. 

Ilomologs nucleic acids encoding MOLX proteins derived from species other than 
human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or high 
stringency hybridi zation with all or a portion of the particular human sequence as a probe using 
methods well known in the ait for nucleic acid hybridization and cloning. 
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As used herein, the phrase "stringent hybridization conditions" refers to conditions under 
which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no other 
sequences. Stringent conditions arc sequence -dependent and will be different in different 
circumstances. Txmgcr sequences hybridize specifically at higher temperatures than shorter 
5 sequences. Generally, stringent conditions are selected to be about 5 C C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pK The Tm is the 
temperature (under defined ionic strength, pll and nucleic acid concentration) at which 50% of 
the probes complementary to the target sequence hybridize to the target sequence at equilibrium. 
Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied 

1 0 at equilibrium. Typically* stringent conditions will be those in which the salt concentration is 
less than about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion (or other salts) at 
pH 7.0 to 83 aud the temperature is at least about 30°C for short probes, primers or 
oligonucleotides (e.g. , 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 

1 5 agents, such as fonnamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, et 
at., (eds.), CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, N.Y. (1989), 
6.3.1-63.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 
85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each 

20 other. A non-limiting example of stringent hybridization conditions are hybridization in a high 
salt buffer comprising 6X SSC S 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% FYT, 0.02% 
Fieoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C, followed by one or 
more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to the sequences of SEQ ID NOS: 1 , 3, 5, 7, 

25 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29 corresponds to a naturally-occuaing nucleic acid 
molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or 
DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural 
prfatein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic acid 
30 molecule comprising the nucleotide sequence of SEQ IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, and 29 or fragments, analogs or derivatives thereof, under conditions of moderate 
stringency is provided. A non-limiting example of moderate stringency hybridization conditions 
aTe hybridization in 6X SSC, 5X Denbardt's solution, 0.5% SDS and 100 mg/ml denatured 
salmon sperm DNA at 55°C, followed by one or more washes hi IX SSC, 0. 1% SDS at 37 3 C. 
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Other conditions of moderate stringency that may be used are well-known within the art. See, 
e.g. f AusubcU ct al (eds.), 1993, CURRENT PROTOCOLS IN MOLECULAR Biology, John Wiley & 
Sons, NY, and Kriegler, 1 990; GENE TRANSFER AND EXPRESSION, A LABORATORY MANUAL, 
Stockton Press, NY. 

5 In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 

comprising the nucleotide sequences of SEQ ID NOS:l, 3, 5, ?, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 
27, and 29 or fragments, analogs or derivatives thereof, under conditions of low stringency, is 
provided. A non-limiting example of low stringency hybridization conditions axe hybridization 
in 35% tbrmamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 

10 Ficoll, 0.2% BSA, 300 mg/ml denatured salmon sperm DNA, 10% (wt/vol) dextran sulfate at 
40 C C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 
0. 1 % SDS at 50°C. Other conditions of low stringency that may be used are well known in the 
art as employed for cross-species hybridizations). See, e.g tJ Ausubel, et al (eds.), 1993, 
Current Protocols in molecular Biology, John Wiley & Sons. NY, and Kriegler, 1990, 

1 5 Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; iShilo and 
Weinberg, 1981. Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

, . In addition to naturally-occurring allelic variants of MOLX sequences that may exist in 

20 the population, the skilled artisan will further appreciate that changes can be introduced by 

mutation into the nucleotide sequences of SEQ ID NOS:l, 3, 5, 7,9, 11, 13, 15, 17, 19,21,23, 
25, 27, and 29 thereby leading to changes in the amino acid sequences of the encoded MOLX 
proteins, without altering the functional ability of said MOLX proteins. For example, nucleotide 
substitutions leading to amino acid substitutions at "non-essential" amino acid residues can he 
25 made in the sequence of SEQ ID NOS:2, 4, 6 ; 8, 10, 12, 14, 16, 3 8, 20, 22, 24, 26, 28, and 30. A 
n non-essentiar amino acid residue is a residue that can be altered from the wild-type sequences 
of the MOLX proteins without altering their biological activity, whereas an "essential" amino 
acid residue is required for such biological activity. For example, amino acid residues that are 
conserved among the MOLX proteins of the invention are predicted to be particularly non- 
30 amenable to alteration. Amino acids for which conservative substitutions can he made are well- 
known within the art 

Another aspect of the invention pertains to nucleic acid molecules encoding MOLX 
proteins that contain changes in amino acid residues that are not essential for activity. Such 
MOLX proteins differ in amino acid sequence from SEQ ID NOS:2. 4, 6, 8, 10, 12, 14, 1 6, 18, 



92 



WO 111/81578 



PCT/l S01/1J578 



20, 22, 24, 26, 28, and 30 yet retain biological activity. In one embodiment, the isolated nucleic 
acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein 
comprises an ammo acid sequence at least about 45% homologous to the amino acid sequences 
nf SHQ TDNOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. Preferably, the protein 
5 encoded by the nucleic acid molecule is at least about 60% homologous to SEQ ID NOS:2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; more preferably at least about ?0% homologous 
to SEQ ID N0S2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, and 30; still more preferably at 
least about 80% homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
and 30 ; even more preferably at least about 90% homologous to SHQ 1 D NOS:2, 4, 6, 8, 1 0, 12, 

10 14, 1 6, 1 8, 20, 22 2 24, 26, 28, and 30; and most preferably at least about 95% homologous lo 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. 

An isolated nucleic acid molecule encoding an MOLX protein homologous to the protein 
of SEQ ID NOS:2, 4, 6, 8 5 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30 can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 

15 sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29 such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded protein. 

Mutations can be introduced into SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, and 30 by standard techniques, such as site-directed mutagenesis and PCR-mediated 
i mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 

20 predicted, non-essential amino acid residues. A "conservative amino acid substitution" is one in 
which the amino acid residue is replaced with an amnio acid residue having a similar side chain. 
Families of amino acid residues having similar side chains have been defined within the art 
These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic 
side chains (e.g. 9 aspartic acid, glutamic acid), uncharged polar side chains glycine, 

25 asparagine., glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains {e.g, 9 threonine, valine, isoleucine) and aromatic side chains tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted non-essential amino acid residue in the MOLX protein 
is replaced with another amino acid residue from the same side chain family. Alternatively, in 

30 another embodiment, mutations can be introduced randomly along all or part of an MOLX 

coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened 
for MOLX biological activity to identify mutants that retain activity. Following mutagenesis of 
SEQ ID NOS: 1, 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 21 , 23, 25, 27, and 29, the encoded protein can be 
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expressed by any recombinant technology known in the art and the activity of the protein can be 
determined 

The rclatedness of amino acid families may also be determined based on side chain 
interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
5 conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, M1LF, HY, PYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 
substituted for each other. Likewise, the "weak" group of conserved residues may be any one of 
the following: CSA 3 ATV, SAG, STNX, STPA, SGND, SKDEQK, NDEQHK, NEQHRK, 
10 VLIM, HFY, wherein (he letters within each group represenl the single letter amino acid code. 

In one embodiment, a mutant MOLX protein can be assayed for (!) the ability to form 
pioteiniprotein interactions with other MOLX proteins, other cell-surface proteins, or 
biologically-active portions thereof (fir) complex formation between a mutant MOLX protein 
and an MOLX ligand; or (iff) the ability of a mutant MOLX protein to bind to an intracellular 
15 target protein or biologically-active portion thereof; avidin proteins). 

In yet another embodiment, a mutant MOLX protein can be assayed for the ability to 
regulate a specific biological function (e.g v regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules thai 
20 are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NOS:l, 3, 5 S 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or fragments, 
analogs or derivatives thereof. An tf antisense" nucleic acid comprises a nucleotide sequence that 
is complementary to a "sense" nucleic acid encoding a protein (e.g. , complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRM A sequence). 
25 In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence 
complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire MOLX 
coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, 
homologs, derivatives and analogs of an MOLX protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 
1 6, 1 8, 20, 22. 24, 26, 28, and 30, or antisense nucleic acids complementary to an MOLX nucleic 
30 acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, are 
additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence encoding an MOLX protein. The term "coding 
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region" refers to the region of the nucleotide sequence comprising codons which are translated 
into amino acid residues. In another embodiment, the antisensc nucleic acid molecule is 
antisensc to a "noncoding region" of the coding strand of a nucleotide sequence encoding the 
MOLX protein. The term "noncoding region" refers to 5' and 3' sequences which flank the 
5 Lading region thai aie not translated into amino acids {i.e., also referred to as 5* and 3' 
untranslated regions). 

Given the coding strand sequences encoding the MOLX protein disclosed herein, 
antisensc nucleic acids of the invention can be designed according to the rules of Watson and 
Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary to 

10 the entire coding region of MOLX mRNA, but more preferably is an oligonucleotide that is 
antisense to only a portion of the coding or noncoding region of MOLX rnRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of MOLX mKN A. An antisense oligonucleotide can he, for example, about 5, 1 0, 1 5, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 

15 be constructed using chemical synthesis or enzyniatic ligation reactions using procedures known 
in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 
chemically synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids (e.g. , phosphorothioate 

20 derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 
include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminoinethyl- 
2-thiouridine, S^aifoxymethylaminomeihyluiaeil, dihydiouracil, beta-D-galactosylqueosine, 

25 inosine, N6-isopentenyladenine. l-methylguanine, 1-mefhylinosine, 2,2-dimethylguanine, 
2«methyladenine s 2-methylguanine, 3-methylcytosine, 5-methylcytosinc 3 N6-adenme, 
7-methylgiianine, 5-methylaminomcthyluracil, 5-methoxyaminomethyi-2-thiouracil, 
beta-D-manno sy Iqueosine, 5 , -methoxycarboxymethylurdcil > 5-niethoxyuraril, 

2- methylttao-N6-isopentenyladenine> uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
30 queosine. 2-fhiocytosiiie, S-methyl^-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 

uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (y), 5-mcthyl-2-thiouradl, 

3- (3-amino-3-N-2-carboxypropyl) uracil. (acp3)w, and 2,6-diammopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (Le. f RNA transcribed from the 
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inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsoction). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mKNA and'or 
5 genomic DNA encoding an MOLX protein to theieby inhibit expression of the protein feg. , by 
inhibiting transcription and/or translation). The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. An example of a route of adxninislration of antisense nucleic acid molecules of the 

10 invention includes direct Lnjection at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and then administered systeroically. For 
example, foT systemic administration, antisense molecules can be modified such that they 
specifically bind to receptors or antigens expressed on a selected cell surface (eg., by linking the 
antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 

1 5 antigens). The antisense nucleic acid molecules can also be delivered to cells using the vectors 
described herein. To achieve sufficient nucleic acid molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol HI promoter 
are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
20 a-anomeric nuckic acid molecule. An a-anomeric nucleic acid molecule forms specific 

double-stranded hybrids with complementary RNA in which* contrary to the usual P-units, the 
strands run parallel to each other. See, e.g., Gaultier, et al., 1 987. Nucl Acids Res, 15: 
6625-6641. Th6 antisense nucleic acid molecule can also comprise a 2 f -o-methylrilx>micleotide 
(see, e.g., hione, et a/. 1987. Nucl. Acids Res. 15: 6131-6148) or a chimeric RNA-DNA analogue 
25 (see, e.g. t Inoue, et a/., 1987. FEES Lett 215: 327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, and 
nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
30 modifications are carried out at least m part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. Riboaymes 
are catalytic RNA molecules withribonuclease activity that are capable of cleaving a 
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single-stranded nucleic acid, such as an niRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes as described in HasclhofF and Gerlach 1988. 
Nature 334: 585-591) can be used to catalytically cleave MOLX mRNA transcripts to thereby 
inhibit translation of MOLX mRNA. A ribozyme having specificity for an MOLX-encoding 
5 nucleic acid can be designed based upon the nucltf ulide sequence of an MOLX cDNA disclosed 
herein (i.e., SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29). For example, a 
derivative of a Tetrahymena L-19 TVS RNA can be constructed in which the nucleotide sequence 
of the active site is complementary to the nucleotide sequence to be cleaved in an 
MOLX-encodingmRNA. See, e.g., U.S. Patent 4,987,071 to Cech, et al. and US. Patent 

10 5,1 16,742 to Cech, et al MOLX mRNA can also be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Battel etal., (1993) 
Science 261:1411-1418. 

Alternatively, MOLX gene expression can be inhibited by targeting nucleotide sequences 
complementary to the Tegulatory region of the MOLX nucleic acid the MOLX promoter 

1 5 and/or enhancers) to form triple helical structures that prevent transcription of the MOLX gene in 
target cells. See, e.g., Helene, 1991. Anticancer Drug Des. 6: 569-84; Ilelene, etal. 1992. Ann. 
N.Y. Acad. Set. 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the MOLX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate; backbone to improve, eg. , the stability, hybridization, or solubility of 

20 the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be 
modified to generate peptide nucleic acids. See, e.g., Hyrup, et ah, 1 996. Bioorg Med Chem 4: 
5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics 
(e.g. , DNA mimics) in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

25 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup, et al., 1996. supra, 
Perry-O'Keefe, etal., 1996. Proa Natl Acad Sci. USA 93: 14670-14675, 

PNAs of MOLX can be used in therapeutic and diagnostic applications. For example, 

30 PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 

expression by, eg., inducing transcription or translation arrest or inhibiting replication. PNAs of 
MOLX can also be used, for example, in the analysis of single base pair mutations in a gene 
(e.g. , PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., Si nucleases (see, Hyrup, et aL, \996.supra); or as probes or primers 
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for DMA sequence and hybridization (see, Hyrup, et aL, 1996, supra; Perry-O'Keefe, ei aL, 
1996. supra). 

In another embodiment, PNAs of MOLX can be modified, eg, to enhance their stability 
or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the foimation of 
5 PNA-DNA chimeras, or by (lie use of liposomes or oilier techniques of drug delivejy known in 
the art. For example, PNA-DNA chimeras of MOLX can be generated that may combine the 
advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes 
(eg. , RNase H and DNA polymerases) to interact with the DNA portion while the PNA portion 
would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using 

10 linkers of appropriate lengths selected in terms of base slacking, number of bonds between the 
nucleobases, and orientation {see, Hyrup, et aL, 1996. supra). The synthesis of PNA-DNA 
chimeras can be performed as described in Ilyrup, ei aL, 1996. supm and Finn, et al. 9 1996. Nucl 
Acids Res 24: 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g„ 

1 5 S^-methoxytritylJamino-S^eoxy-thytnidine phosphoramidite, can he used between the PNA 
and the 5' end of DNA. See, e.g., Mag, et aL, 1989. Nucl Acid Res 17: 5973-5988. PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5" PNA 
segment and a 3* DNA segment. See, e.g., Finn, et aL, 1996. supra. Alternatively, chimeric 
molecules can be synthesized with a 5 f DNA segment and a 3' PNA segment See, e.g., Petersen, 

20 etaL, 1975. Bioorg Med. Chem. Lett 5: 1119-11124. 

Li other embodiments, the oligonucleotide may include other appended groups such as 
peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, eg, Letsinger, etaL, 1989. Proc. Natl. Acad Set U.SA. 86: 6553-6556; 
Lernaitre, et aL, 1987. Prac. Nail. Acad ScL 84: 64S-652; PCT Publication No. WO88/09810) or 

25 the blood-brain barrier (see, eg, PCT Publication No. WO 89/10134). ^addition, 

oligonucleotides can be modified with hybridization triggered cleavage agents (see, eg., Kial,et 
al. f 1988. BioTechmques 6:958-976) or intercalating agents (see, eg, 2cm, 1988. Pharm, Jfca?. 5: 
539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent a transport agent a hybridization-triggered 

30 cleavage agent, and the like. 

MOLX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the ammo acid 
sequence of MOLX polypeptides whose sequences are provided in SEQ ID NOS:2, 4, 6, 8, 10, 
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12, 14, 16, 18, 20. 22, 24, 26, 25, and 30. The invention also includes a mutant or variant protein 
any of whose residues may be changed from the corresponding residues shown in SEQ ID 
NOS:2, 4, 6, S, 10, 12, 14, 1 6, 1 8, 20, 22, 24, 26, 28, and 30 while still encoding a protein that 
maintains its MOLX activities and physiological functions, or a functional fragment thereof. 
5 In general, an MOLX variant that preserves MOLX-like function includes any variant in 

which residues at a particular position in the sequence have been substituted by other amino 
acids* and further include the possibility of inserting an additional Ten due or residues hetween 
two residues of the parent protein as well as the possibiHty of deleting one or niori residues fr^ 
the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed by the 
10 invention. In favorable circumstances, the substitution is a conservative substitution as defined 
above. 

One aspect of the invention pertains to isolated MOLX proteins, and biologically-active 
portions thereof, or derivatives, fragments, analogs or homologs thcruot Also provided are 
polypeptide fragments suitable for use as immunogens to raise anti-MOLX antibodies. In one 

1 5 embodiment, native MOLX proteins can be isolated from cells or tissue sources by an 

appropriate purification scheme using standard protein purification techniques. In another 
embodiment, MOLX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an MOLX protein or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

20 An '"isolated" or "purified" polypeptide oi protein or biologically-active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the MOLX protein is derived, or substantially free from chemical precursors 
or other chemicals when chemically synthesized. The language "substantially free of cellular 
material" includes preparations of MOLX proteins in which the protein is separated from cellular 

25 components of the cells from which it is isolated or recombinanily-produced. In one 

embodiment, the language "substantially free of cellular material" includes preparations of 
MOLX proteins having less than about 30% (by dry weight) of non-MOLX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non-MOLX proteins, still more preferably less than about 10% of non-MOLX proteins : and most 

30 preferably less than about 5% of non-MOLX proteins. When the MOLX protein or biologically- 
active portion thereof is recombmantly-produccd, it is also preferably substantially free of 
culture medium, i.e., culture medium represents less than about 20%, more preferably less than 
about 10%, and most preferably less than about 5% of the volume of the MOLX protein 
preparation. 
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The language "substantially free of chemical precursors or other chemicals" includes 
preparations of MOLX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals' 1 includes preparations of 
5 MOLX proteins having less than about 30% (by dry weight) of chemical precursors or 

non-MOLX chemicals, more preferably less than about 20% chemical precursors or non-MOLX 
chemicals, still more preferably less than about 10% chemical precursors or non-MOLX 
chemicals, and most preferably less than about 5% chemical precursors ox non-MOLX 
chemicals. 

1 0 Biologically-active portions of MOLX proteins include peptides comprising amino acid 

sequences sufficiently homologous to or derived from the amino acid sequences of the MOLX 
proteins (e,g. 9 the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, and 30) that include fewer amino acids than the fiill-length MOLX proteins, and 
exhibit at least one activity of an MOLX protein. Typically, biologically-active portions 

15 comprise a domain or motif with at leasL one activity of the MOLX protein. A biologically- 
active portion of an MOLX protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, con be prepared by recombinant techniques and evaluated for one or more of the 

20 functional activities of a native MOLX protein. 

In an embodiment the MOLX protein has an amino acid sequence shown in SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. In other embodiments, the MOLX 
protein is substantially homologous to SEQ H>NOS:2, 4, 6, 8, 10, 12, 14, 16,18, 20,22,24,26, 
28, and 30, and retains the functional activity of the protein of SEQ ED NOS:2, 4, 6, 8, 10, 12, 

25 14, 16, 18. 20, 22, 24, 26, 28, and 30, yet differs in amino acid sequence due to natural allelic 

variation or mutagenesis, as described in detail, below. Accordingly, in another embodiment, the 
MOLX protein is a protein that comprises an amino acid sequence at least about 45% 
homologous to the amino acid sequence SEQ ID NOS:2, 4, 6, 8. 10, 12, 14, 16> 18, 20, 22. 24, 
26, 28, and 30, and retains the fractional activity of the MOLX proteins of SEQ ID NOS:2, 4, 6, 

30 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic acids, 
the sequences are aligned for optimal comparison purposes (e.g.» gaps can be introduced in the 
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sequence of a first amino acid or nucleic acid sequence for optima! alignment with a second 
amino or nucleic acid sequence). Hie amino acid residues or nucleotides at corresponding amino 
acid positions or nucleotide positions are then compared. When a position in the first sequence 
is occupied by the same amino acid residue or nucleotide as the corresponding position in the 
5 second sequence, then the molecules are homologous at that position (i. e. , as used herein amino 
acid or nucleic acid "homology 11 is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences- This homology may be determined using computer programs known in 
the art, such as GAP software provided in the GCG program package. See, Needleman and 

1 0 Wunsch, 1970. JMol Biol 48: 443453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty 
of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a 
degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the 
CDS (encoding) part of the DNA sequence shown in SKQ ID NOS:l, 3, 5, 7, 9, 11 , 13, 15, 17, 

15 19,21, 23,25, 27, and 29. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over lhat region of comparison, detemiinmg the number of positions 

20 at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of nucleic acids) 
occurs in both sequences to yield the number of matched positions, dividing the number of 
matched positions by the total number of positions in the region of comparison (ie., the window 
size), and multiplying the result by 100 to yield the percentage of sequence identity. The term 
"substantial identity" as used herein denotes a characteristic of a polynucleotide sequence, 

25 wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, 
preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually 
at least 99 percent sequence identity as compared to a reference sequence over a comparison 
region. 

30 Chimeric and Fusion Proteins 

The invention also provides MOLX chimeric or fusion proteins. As used herein, an 
MOLX "chimeric protein" or "fusion protein" comprises an MOLX polypeptide operatrvefy- 
linkcd to a non-MOLX polypeptide. An "MOLX polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to an MOLX protein (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
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18, 20, 22, 24, 26, 28, and 30), whereas a n non-MOLX polypeptide" refers to a polypeptide 
having an amino acid sequence con-esponding to a protein that is not substantially homologous 
to the MOLX protein, eg. , a protein that is different from die MOLX protein and that is derived 
from the same or a different organism. Within an MOLX fusion protein the MOLX polypeptide 
5 ean correspond to all or a portion of an MOLX protein. In one embodiment, an MOLX fusion 
protein comprises at least one biologically-active portion oFan MOLX protein. In another 
embodiment an MOLX fusion protein comprises at least two biologically-active portions of an 
MOLX protein. In yet another embodiment, an MOLX fusion protein comprises at least three 
biologically-active portions of an MOLX protein. Within the fusion protein, the term 

10 "operatively-Enked" is intended to indicate that the MOLX polypeptide and the non-MOLX 
polypeptide are fused in-frame with one another. The non-MOLX polypeptide can be fused to 
the N-tenninus or C-teaninus of the MOLX polypeptide. 

In one embodiment, the fusion protein is a GST-MOLX fusion protein in which the 
MOLX sequences are fused to the C-termmus of die GST (glutathione S-transferase) sequences. 

1 5 Such fusion proteins can facilitate the purification of recombinant MOLX polypeptides. 

In another embodiment, the fusion protein is an MOLX protein containing a heterologous 
signal sequence at its N-terminus. In certain host cells (e.g., mammalian host cells), expression 
and/or secretion of MOLX can be increased through use of a heterologous signal sequence. 
In yet another embodiment, the fusion protein is an MOLX-mimunoglobulin fusion 

20 protein in which the MOLX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The MOLX-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an 
interaction between an MOLX ligand and an MOLX protein on the surface of a cell, to thereby 
. suppress MOLX-mediated signal transduction in vivo. The MOLX-nrnrjunoglobulin fusion 

25 proteins can be used to affect the bioavailability of an MOLX cognate ligand. Inhibition of the 
MOLX ligand/MOLX interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e,g. promoting or inhibiting) 
cell survival. Moreover, the MOLX-immunoglobulin fusion proteins of the invention can be 
used as immunogens to produce anti-MOLX antibodies in a subject, to purity MOLX Jig&nds> 

30 and in screening assays to identify molecules that inhibit the interaction of MOLX with an 
MOLX ligand. 

An MOLX chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional techniques, 
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e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, fiUing-in of cohesive ends as appropriate, alkaline 
phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by conventional techniques including 
5 automated DNA synihesizeis. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers that give rise to complementary overhangs between two 
consecutive gene fragments that can subsequently be annealed and reamplified to generate a 
chimeric gene sequence (see, e.g., Ausubel, et al (eds.) CURRENT PROTOCOLS IN MOLECULAR 
Biology, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
10 available that already encode a fusion moiety (e.g., a GST polypeptide). An MOLX-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked 
in-ftame to the MOLX protein. 



MOLX Agonists and Antagonists 

15 Hie invention also pertains to variants of the MOLX proteins mat function as either 

MOLX agonists {i.e., mimelics) or as MOLX antagonists. Variants of the MOLX protein can be 
generated by mutagenesis (e,g,, discrete point mutation or truncation of the MOLX protein). An 
agonist of the MOLX protein can retain substantially the same, or a subset of, the biological 
activities of the naturally occurring form of the MOLX protein. An antagonist of the MOLX 

20 protein can inhibit one or more of the activities of the naturally occurring form of the MOLX 
. protein by, for example, competitively binding to a downstream or upstream member of a 
cellular signaling cascade which includes the MOLX protein. Thus, specific biological effects 
can be elicited by treatment with a variant of limited function. In one embodiment, treatment of 
a subject with a variant having a subset of the biological activities of the naturally occurring 

25 form of the protein has fewer side effects m a subject relative to treatment with the naturally 
occurring form of the MOLX proteins. 

Variants of the MOLX proteins that function as either MOLX agonists (i.e. t mimetics) or 
as MOJ J"C antagonists can be identified by screening combinatorial libraries of mutants {cg. t 
truncation mutants) of the MOLX proteins for MOLX protein agonist or antagonist activity. In 

30 one embodiment, a variegated library of MOLX variants is generated by combinatorial 

mutagenesis at the nucleic acid level and is encoded by a variegated gene library, A variegated 
library of MOLX variants can be produced by, for example, enzymatically ligathig a mixture of 
synthetic oligonucleotides into gene sequences such that a degenerate set of potential MOLX 
sequences is expressible as individual polypeptides, or alternatively, as a set of lanjer fusion 
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proteins {e.g. 9 for phage display) containing the set of MOLX sequences therein. There are a 
variety of methods which can be used to produce libraries of potential MOLX variants from a 
degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
5 appropriate expressiun vector. Use uf a degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set of potential MOLX sequences. 
Methods for synthesizing degenerate oligonucleotides are well-known within the ait. See, e.g., 
Narang, 1983. Tetrahedron 39: 3; Ttakura, et aL, 1 984. Annu. Rev. Biochem. 53: 323; Italaira, et 
of., 1984. Science 198: 1056; Ike, et al, 1983. Nucl Acids Res. 1 1 : 477. 

10 

Polypeptide libraries 

In addition, libraries of fragments of the MOLX protein coding sequences can be used to 
generate a variegated population of MOLX fragments for screening and subsequent selection of 
variants of an MOLX protein. In one embodiment, a library of coding sequence fragments can 

15 be generated by treating a double stranded PCR fragment of an MOLX coding sequence with a 
nuclease under conditions wherein nicking occurs only about once per molecule, denaturing the 
double stranded DNA, renaturing the DNA to form double-stranded DNA that can include 
sensc/antisense pairs irom different nicked products, removing single stranded portions from 
reformed duplexes by treatment with Si nuclease, and ligating the resulting fragment library into 

20 an expression vector. By this method, expression libraries can be derived which encodes 
N-teiminal and internal fragments of various sizes of the MOLX proteins. 

Various techniques axe known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of the 

25 gene libraries generated by the combinatorial mutagenesis of MOLX proteins. The most widely 
used techniques, which are amenable to high throughput analysis, for screening large gene 
libraries typically include cloning the gene library into Teplicabfe expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates isolation 

30 of the vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a new technique that enhances the frequency of functional mutants in the libraries, can 
be used in combination with the screening assays to identify MOLX variants. See,, e.g., Arkin 
and Yourvun, 1 992. Five. Natl. Acad. Set. USA 89: 781 1-7815; Delgrave, et al, 1 993. Protein 
Engineering 6:327-331, 
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Anti-MOLX Antibodies 

The invention encompasses antibodies and antibody fragments, such as F a b or {F^bh t fliat 
bind imrnunospeciflcalry to any of the MOLX polypeptides of said invention. 

An isolated MOLX protein, or a portion or fragment thereof, can be used as an 
5 immunogen to generate antibodies that bind to MOLX polypeptides using standard techniques 
for polyclonal and monoclonal antibody preparation. The full-length MOLX proteins can be 
used or, alternatively, the invention provides antigenic peptide fragments of MOLX proteins for 
use as immunogens. The antigenic MOLX peptides comprises at least 4 amino acid residues of 
the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 10. 12, 14, 16, 18, 20, 22, 24 r 26, 28, 

10 and 30 and encompasses an epitope of MOLX such that an antibody raised against the peptide 
forms a specific immune complex with MOLX. Preferably, the antigenic peptide comprises at 
least 6, 8, 10, 15, 20, or 30 amino acid residues. Longer antigenic peptides are sometimes 
preferable over shorter antigenic peptides, depending on use and according to methods well 
known to someone skilled in the art 

15 In certain embodiments of the invention, at least one epitope encompassed by the 

antigenic peptide is a region of MOLX that is located on the surface of the protehi (e.g., a 
hydrophilic region). As a means for targeting antibody production, hydropathy plots showing 
regions of hydrophilicity and hydrophobicity may be generated by any method well known in the 
art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 

20 without Fourier transformation (see, e.g., Hopp and Woods, 198L Proc. Nat Acad. Set USA 78: 
3824-3828; Kyte and Doolittle, 1982. J. Mot Biol. 157: 105-142, each incorporated herein by 
reference in their entirety). 

As disclosed herein, MOLX protein sequences of SEQ IDNOS:2, 4, 6, 8, 10, 12, 14, 16, 
1 8, 20, 22, 24, 26, 28, and 30, or derivatives, fragments, analogs or homologs thereof, rriay be 

25 utilized as immunogens in the generation of antibodies that immunospecifically-bind these 

protein components. The term "antibody" as used herein refers to immunoglobulin molecules 
and immunologically-active portions of immunoglobulin molecules, molecules that contain 
an antigen binding site that specificalry-binds (immunoreacts with) an antigen, such as MOLX. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 

30 Fab and fragments, and an F a b expression library. In a specific embodiment, antibodies to 
human MOLX proteins are disclosed. Various procedures known within the art may be used for 
the production of polyclonal or monoclonal antibodies to on MOLX protein sequence of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or a derivative, Iragment, analog or 
homolog thereof. Some of these proteins are discussed below. 
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For the production of polyclonal antibodies, various suitable host animals (e.g, rabbit, 
goat, mouse or other mammal) may be immunized by injection with the native protein, or a 
synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic 
preparation can contain, for example, recombinantly-expressed MOLX protein or a chemically- 
5 synthesized MOLX polypeptide. The preparation can further include an adjuvant. Various 
adjuvants used to increase the immunological response include, but are not limited to, Freund's 
(compete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances 
{e.g., lysolecidun, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), 
human adjuvants such as Bacille Calmette-Guerin and Corynebactetium parvum, or similar 

10 immunostimulatory agents. If desired, the antibody molecules directed against MOLX can be 
isolated from the mammal (e.g. s from the blood) and further purified by well known techniques, 
such as protein A chromatography to obtain the IgG fraction. 

The term ''monoclonal antibody" or "monoclonal antibody composition 11 , as used herein, 
refers to a population of antibody molecules that contain only one species of an antigen binding 

15 site capable of immunoreacting with a particular epitope of MOLX. A monoclonal antibody 

composition thus typically displays a single binding affinity for a particular MOLX protein with 
which it immunoreacts. For preparation of monoclonal antibodies directed towards a particular 
MOLX protein, or derivatives, fragments, analogs or homologs thereof, any technique that 
provides for the production of antibody molecules by continuous cell line culture may be 

20 utilized. Such techniques include, but are not limited to, the hybridoma technique {see, e.g., 
Kohler & Milstein, 1975. Nature 256: 495-497); the trioma technique; the human B-cell 
hybridoma technique (see, eg., Koztoor, et al, 1983. Immunol Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies (see, e.g., Cole, et al, 1985. In; 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 

25 monoclonal antibodies may be utilized in the practice of the invention and may be produced by 
using human hybridomas (see, e.g., Cote, eta!., 1983. Proc Natl Acad Sci USA 80: 2026-2030) 
or by transforming human B-cells with Epstein Barr Virus in vitro (see, e.g., Cole, et al, 1985. 
In: Monoclonal Anttoodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Each of 
the above citations is incorporated herein by reference in their entirety. 

30 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an MOLX protein (see, e.g., U.S. Patent No. 4,946,778). In addition, 
methods can be adapted for the construction of Fob expression libraries (see, e.g., Huse, et al, 
1 989. Science 246: 1275-1281) to allow rapid and effective identification of monoclonal F*b 
fragments with the desired specificity for an MOLX protein or derivatives, fragments, analogs or 
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homo logs thereof. Non-human antibodies can be "humanized" by techniques well known in the 
art. See, e.g., U.S. Patent No. 5,225,539. Antibody fragments that contain the tdiotypes to an 
MOLX protein may be produced by techniques known in flic art including, but not limited to; (r) 
an fragment produced by pepsin digestion of an antibody molecule; (it) an F a b fragment 
5 generated by reducing the disulfide bridges of an Fppp fragment; (Hi) an F^ fragment generated 
by the treatment of the antibody molecule with papain and a reducing agent; and (/v) b\ 
fragments. 

Additionally, recombinant anti-MOLX antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be made 

10 using standard recombinant DNA techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in International Application 
No. PCT/US86/02269; European Patent Application No. 1 84,1 87; European Patent Application 
No. 171,496; European Patent Application No. 173,494; PCTIhtemationalPubHcationNo. WO 

1 5 86/01533; U.S. Patent No. 4,8.16,567; U.S. Pat No. 5,225,539; European Patent Application No. 
125,023; Better, et aL, 1988. Science 240: 1041-1043; Iiu, et a/., 1987. Proc. Natl Acad. Set. 
USA 84: 3439-3443; Iiu, etaL, 1987.7. Immunol. 139: 3521-3526; Sun, etal, 1987. Proc. Natl 
Acad Set IJSA 84: 214-21 8; Nishmiura, et aL, 1987. Cancer Res. 47: 999-1005; Wood, et a/., 

1 985. Nature 3 14 :446-449; Shaw, et al y 1 988. J. Nati Cancer Inst 80: 1553-1 559); 

20 Morrison(1985) Science 229:1202-1207; Oi, et al (1986) BioTechniques 4:214; Jones, et aL, 

1986. Nature 321: 552-525; Vcihocyan, et ai, 1988. Science 239: 1534; and Beidler, et a!., 
1 988. J. Immunol. 141: 4053-4060. Each of the above citations are incorporated herein by 
reference in their entirety. 

In one embodiment, methods for (lie screening of antibodies that possess the desired 
25 specificity include, but are not limited to ? eiizyme-linked hnmunosorbent assay (ELISA) and 
other immunologically-mediated techniques known within the art. In a specific embodiment, 
selection of antibodies that are specific to a particular domain of an MOLX protein is facilitated 
by generation of hybridomas (hat bind to the fragment of an MOLX protein possessing such a 
domain. Thus, antibodies mat are specific for a desired domain within au MOLX protein, or 
30 derivatives, fragments, analogs or homobgs thereof, are also provided herein. 

Anti-MOLX antibodies may be used in methods known within the art relating to the 
localization and/or quantitation of an MOLX protein (e.g., for use in measuring levels of die 
MOLX protein within appropriate physiological samples, for use in diagnostic methods, for use 
in imaging die protein, and the like). In a given embodiment, antibodies for MOLX proteins, or 
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derivatives, fragments, analogs or horaologs thereof, that contain the antibody derived binding 
domain, are utilized as pharmacologically-active compounds (hereinafter "Therapeutics' 1 ). 

An anti-MOLX antibody (eg., monoclonal antibody) can be used to isolate an MOLX 
polypeptide by standard techniques, such as affinity chromatography or immunoprecipiration, 
5 An anli-MOLX antibody can facilitate the purification of natural MOLX polypeptide from cells 
and of recombinantly-produced MOLX polypeptide expressed in host cells. Moreover, an 
anti-MOLX antibody can be used to detect MOLX protein (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the MOLX protein . 
Anti-MOLX antibodies can be used diagnostically to mom tor protein levels in tissue as part of a 

10 clinical testing procedure, e.g. 9 to, for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (Le. , physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, hmrmesccnt materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 

15 P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 

include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanute, rhodamine, dicrdorotriazmyiamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes • 
luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and 

20 examples of suitable radioactive material include 125 I, l31 1, 33 S or 3 H. 

MOLX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an MOLX protein, or derivatives, fragments, analogs or 

25 homologs thereof, As used herein, the term n vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of vector is a "plasrnid*, 
which refers to a circular double stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors arc capable of autonomous replication in a host 

30 cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication 
and episomal mammalian vectors), Other vectors (e.g., norj-episornal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operativcly-linkecL Such vectors are referred to herein as 
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"expression vectors". In general, expression vectors of utility in recombinant DNA techniques 
are often in the form of plasmids. In the present specification, "plasmid" and 'Vector 1 ' can be 
used interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other foims of expression vectors, such as viral vectors 
5 (e.g. 9 replication defective retroviruses, adenoviruses and adeno-associatcd viruses), which serve 
equivalent functions . 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that the 
recombinant expression vectors include one or more regulatory sequences, selected on the basis 

10 of the host cells to be used for expression, that is operatively-linloed to the nucleic acid sequence 
to be expressed. Within a recombinant expression vector, "operably-linked" is intended to mean 
that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner tot 
allows for expression of the nucleotide sequence (e.g> 9 in an in vitro transcription/translation 
system or m a host cell when the vector is introduced into the host cell). 

15 The tenn "regulatory sequence" is intended to includes promoters, enhancers and other 

expression control elements (e.g., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
those that direct constitutive expression of a nucleotide sequence in many types of host cell and 

20 those that direct expression of the nucleotide sequence only in certain host cells (e.g. , 

tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, including 

25 fusion proteins or peptides, encoded by nucleic acids as described herein (e.g. y MOLX proteins, 
mutant forms of MOLX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
MOLX proteins in prokaryotic or eukaiyotic cells. For example, MOLX proteins can be 
expressed in bacterial cells such as Escheridiia era//, insect cells (using baculovims expression 

30 vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 1&5, Academic Press, San Diego, 
Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated 
in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 
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Expression of proteins in proxaryotcs h most often carried out in Escherichia coll with 
vectors containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, 
usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve 
5 three purposes: (i) to increase expression of recombinant protein: (if) to increase the solubility of 
the recombinant protein; and (Hi) to aid in the purification of the recombinant protein by acting 
as a Hgand in affinity purification. Often, in fiision expression vectors, a proteolytic cleavage 
site is introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from Lhe fusion moiety subsequent to purification of the 

10 fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, 
thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, 
Mass.) audpRTTS (Pharmacia, Piscafcaway, NJ.) that mse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to lhe larget recombinant protein* 

15 Examples of suitable inducible non-fusion E, coii expression vectors include pTrc 

(Amrann et aJ., (1988) Gene 69:301-3 1 5) and pET lid (Studier et al, GENE EXPRESSION 
TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 
60-89), 

One strategy to maximize recombinant protein expression m E coli is to express the 
20 protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant 

protein. See, e.g t> Gottesman, Genb EXPRESSION TECHNOLOGY: Methods IN ENZYMOLOGY 185, 
Academic Press, San Diego, Cali£ (1990) 119-128. Another strategy is to alter the nucleic acid 
sequence of the nucleic acid ro be inserted into an expression vector so that the individual codons 
for each amino acid are those preferentially utilized mE. coli (see, e.g., Wada, et uL, 1992. NucL 
25 Acids Res. 20: 2111-211 8). Such alteration of nucleic acid sequences of the invention can be 
carried out by standard DNA synthesis techniques. 

In another embodiment, the MOLX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivtiae include pYepSecl 
(Baldari, etal, 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Ce//30: 
30 933-943), pJRY88 (Schulk et al, 1987. Gem 54: 1 13-123), pYES2 (luvitrogen Corporation, 
■ - " San Diego, Calif.), and picZ (InVitiogen Corp. Sari Diego, Calif.). 

Alternatively, MOLX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (eg. , 
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SF9 cells) include the pAc series (Smith, et ai, 1983. Mol. Cell Biol 3: 2156-21 65) and the 
pVL scries (I^jcklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vecLor. Examples of mammalian expression vectors include 
5 pC13M8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et al, 1987. EMBO J. 6: 
187-195). When used in mammalian cells, the expression vector's control functions are often 
provided by viral regulatory elements. For example, commonly used promoters are derived from 
polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable expression 
systems for both prokaryotio and eukaryotic cells see, e.g., Chapters 16 and 17 of Safnbrook, et 

10 ah, Molecular Cloning: A Laboratory Manual. 2nd ed, Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, the recombinant mammalian expression vectoT is capable of 
directing expression of the nucleic acid preferentially in a particular cell type (e.g. , 
tissue-specific regulatory elements are used to express [he nucleic acid). Tissue-specific 

1 5 regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (Kver-specific; Pinkert, et aL, 1987. Genes Dev. L: 
268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Imnuinol 43: 235-275), 
inparticularpromotersofTcell receptors (Winoto and Baltimore, \9Z9.EMBOJ. 8:729-733) 
and immunoglobulins (Banerji, et al„ 1983. Cell 33: 729-740; Queen and Baltimore, 1983. Cell 

20 33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter, Byrne and Ruddle, 
1989. Froc. Noil Acdd. ScL USA 86: 5473-5477), pancreasrspecific promoters (Edhmd, et ol. s 
1985. Science 230: 912-916), and mammary gland-specific promoters (e.g, milk whey promoter; 
U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentalry- 
regulatcd promoters are also encompassed, e.g. t the murine hox promoters (Kessel and Gruss, 

25 1990. Science 249: 374-379) and the a-fetoprotein promoter (Campes and Tilghman, 1989. 
Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That is, 
the DNA molecule is operath'ery-linied to a regulatory sequence in a manner that allows for 

30 expression (by transcription of the DNA molecule) of an RNA molecule that is antisense in 
MOLX mUNA. Regulatory sequences operativety linked to a nucleic acid cloned in the 
antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen that direct constitutive, tissue specific or cell type specific expression of 
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antisense SNA. The antisense expression vector can be in the form of a recombinant plasmid, 
phagemid or attenuated virus in which afltisenac nucleic acids arc produced under the control of 
a high efficiency regulatory region, the activity of which can be determined by the cell type into 
which the vector its introduced. For a discussion of the regulation of gene expression using 
5 antisense genes see, e.g. f Weintraub, et a!., "Antisense £N A as a molecular tool for genetic 
analysis," Reviews-Trends in Genetics, Vol 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced The terms "host cell" and "recombinant 
host cell" atfe used interchangeably herein. It is understood that such terms refer not only to the 

10 particular subject cell but also to the progeny ox potential progeny of such a celL Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, bo identical to the parent cell, but are still included 
within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaiyotic celL For example, MOLX protein can 

15 be expressed in bacterial cells such as K coti, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those 
skilled in the art. 

Vector DNA can be introduced into pTokatyotic or eukaryctic cells via conventional 
transformation or transfection techniques. As used herein, the terms "kanatbnnation n and 

20 "transfection" are intended to refer to a variety of ait-recognized techniques for introducing 
foreign nucleic acid (e,g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dcxtoan-incdiated transfection, lipofection, or electroporation. 
Suitable methods for transforming or tansfecting host cells can be found in Sairibrook, et ul. 
(Molecular Cloninc*; A Laboratory Manual. 2nded, Cold Spring Harbor Laboratory, 

25 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory 
manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small traction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
30 encodes a selectable marker (e.g. , resistance to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as 0418, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding MOLX 
or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic 
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acid can be identified by drug selection (e.g. , cells that have incorporated the selectable marker 
gene will survive, while Che other cells die). 

A host cell of the invention, such as a prokaryotic or oukaryotic host cell in culture, can 
he used to produce (i e. 9 express) MOLX protein. Accordingly, the invention further provides 
5 methods for producing MOLX protein using die host cells of the invention. In one embodiment, 
the method comprises culturing the host cell of invention (into which a recombinant expression 
vector encoding MOLX protein has been introduced) in a suitable medium such that MOLX 
protein is produced. In another embodiment, the method further comprises isolating MOLX 
protein from the medium or the host cell. 

10 Transgenic MOLX Animals 

The host cells of the invention can also he used In produce non-human transgenic 
animals. For example, in one embodiment a host cell of the invention is a fertilized oocyte or an 
embryonic stem cell into which MOLX protein-coding sequences have been introduced. Such 
host cells can then be used to create non-human transgenic animals in which exogenous MOLX 

15 sequences have been introduced into their genome or homologous recombinant animals in which 
endogenous MOLX sequences have been altered. Such animals are useful for studying the 
function and/or activity of MOLX protein and for identify ing and/or evaluating modulators of 
MOLX protein activity. As used herein, a h transgenic animal" is a non-hxunan animal, 
preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of 

20 the cells of the animal includes a tomsgene. Other examples of transgenic animals include 
non-human primates, steep, dogs, cows, goats, chickens, amphibians, etc. A Lraosgene is 
exogenous DNA that is integrated into the genome of a cell from which a transgenic animal 
develops and that remains in the genome of the mature animal, thereby directing the expression 
of an encoded gene product in one or more cell types or tissues of the transgenic animal. As 

25 used herein, a "homologous recombinant animal" is a non-human animal, preferably a mammal, 
more preferably a mouse, in which an endogenous MOLX gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule introduced into a 
cell of the animal, e.g. , an embryonic cell of the animal, prior to development of the animal. 
A transgenic animal of the invention can be created by introducing MOLX-encoding 

30 nucleic acid into the male pronuclei of a fertilized oocyte (e.g. , by micromj ectinn, retroviral 
infection) and allowing the oocyte to develop in a pseudopregnant female fosleT animal. The 
human MOLX cDNA sequences ofSEQ ID NOS:l, 3, 5, 7, 9 ; 11, 13, 15, 17, 19, 21, 23, 25, 27, 
and 29 can be introduced as a Iransgene into the genome of anon-human animal. Alternatively, 
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a non-human homologuc of the human MOLX gene, such as a mouse MOLX gene, can be 
isolated based on hybridization to the human MOLX cDNA (described further supra) and used 
as a transgene. Intconic sequences and polyadenylatioii signals can also be included in the 
transgene Lo increase the efficiency of expression, of the transgene. A tissue-specific regulatory 
5 sequence(s) can be operably-Hnked to the MOLX transgene to direct expression of MOLX 

protein to particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan, 
1986. In: MANTPULATIXG THE MOUSE EMBRYO, Cold Spring Harbor Laboratory Press, Cold 

10 Spring Harbor, N.Y. Similar methods are used for production of other transgenic animals. A 
transgenic founder animal can be identified based upon the presence of the MOLX transgene in 
its genome and/or expression of MOLX rnRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. Moreover, 
' transgenic animals carrying a transgene-encoding MOLX protein can further be bred to other 

1 5 transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at least 
a portion of an MOLX gene into which a deletion, addition or substitution has been introduced to 
thereby alter, eg., functionally disrupt, the MOLX gene. The MOLX gene can be a human gene 
(e the cDNA ofSEQ ID T*OS:l, 3, 5, 7, 9, 11,13, 15, 17, 19, 21, 23, 25, 27, and 29), but 

20 more preferably, is a non-human horaologue of a human MOLX gene. For example, a mouse 
homologue of human MOLX gene of SEQ ID NOS:l , 3, 5, 7, % I 1, 13, 15, 17, 19, 21, 23, 25, 
27, and 29 can be used to construct a homologous recombination vector suitable for altering an 
endogenous MOLX gene in the mnuse genome. In one embodiment, the vector is designed such 
that, upon homologous recombination, die endogenous MOLX gene is functionally disrupted 

25 (i.e., no longer encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous recombination, the 
endogenous MOLX gene is mutated or otherwise altered but still encodes functional protein 
(e.g. , the upstream regulatory region can be altered to thereby alter the expression of the 
endogenous MOLX protein). In the homologous recombination vector, the altered portion of me 

30 MOLX gene is flanked at its 5 1 - and S'-termini by additional nucleic acid of the MOLX gene to 
allow for homologous recombination to occur between the exogenous MOLX gene carried by 
the vector and an endogenous MOLX gene in au embryonic stem cell. The additional flanking 
MOLX nucleic acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5 - and 3'-tennim) 
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are included in the vector. See, e.g., Thomas, et aL L987. Cell 5 1 : 503 for a description of 
homologous recombination vectors. Hie vector is ten introduced into an embryonic stem cell 
line (e.g., by electroporation) and cells in which the introduced MOLX gene has homologously- 
rceumbined wilh the endogenous MOLX gene are selected Sue, e.g. § Li, el ai. t 1992. Cell 69; 
5 915. 

Ttie selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form 
aggregation chimeras, See t eg., Bradley, 1 987. In: TeratocarciNOMAS AMD EMBRYONIC STEM 
Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 1 13-152. A chimeric embryo 
can then be implanted into a suitable pseudopregnant female foster animal and the embryo 

1 0 brought to term. Progeny harboring the honiologousiy-recornbined DNA in their germ cells can 
be used to breed animals in which all cells of the animal contain the homologously-recombincd 
DNA by germline transmission of the transgene. Methods for constructing homologous 
recombination vectors and homologous recombinant animals are described further in Bradley, 
1991. Curr. Opin, Biotechnol 2: 823-829; PCT International Publication Nos.: WO 90/1 1354; 

1 5 WO 91/01 140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
selected systems that allow for regulated expression of the transgene. One example of such a 
system is the crc/ioxP recombinase system of bacteriophage PL For a description of the 
cre/loxP recombinase system, See, e.g t , Lakso, et a/.> 1992. Proc. NatL Acad Sci USA 89: 

20 6232-6236. Another example of a recombinase system is the FLP recombinase system of 

Saccharomyces cerevmae. See, O'Goruian, et al., 1 99L Science 25 1 : 135 1-1 355. If a cre/loxr 
recombinase system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. Such animals 
can be provided through the construction of "double" transgenic animals, e.g., by mating two 

25 transgenic animals, one containing a transgene encoding a selected protein and the other 
containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et aL 7 1997. Nature 385: 810-813. In brief, a cell 
(eg, , a somatic cell) from the transgenic animal can be isolated and induced to exit the growth 

30 cycle and enter Go phase. The quiescent cell can then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the same species from which the quiescent cell 
is isolated The reconstructed oocyte is then cultured such that it develops to morula or 
blastocyte and then transferred to pseudopregnant female foster animal. The offspring home of 
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this female foster animal will be a clone of the animal from which the cell (e.g., the somatic cell) 
is isolated. 

Pharmaceutical Compositions 

The MOLX nucleic acid molecules, MOLX proteins, and anti-MOLX antibodies (also 
5 referred to herein as "active compounds") of the invention, and derivatives, fragments, analogs 
and homologs thereof, can be incorporated into pharmaceutical compositions suitable foT 
administration. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceutical^ acceptable carrier. As used herein, "phannaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 

10 antibacterial and antifungal agents, isotonic and absorption delaying agents, and the Wee, 
compatible with pharmaceutical administration. Suitable carriers ane described in the most 
recent edition of Remington's Pharmaceutical Sciences, a standard reference text in the field, 
which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% human 

1 5 serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be used The 
use of such media and agents for pharmaceutical^ active substances is well known in the art 
Except insofar as any conventional media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

20 A pharmaceutical composition of the invention is formulated to be compatible wilh its 

intended route of administration. Examples of routes of administration include parenteral, eg. , 
intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (/.&, topical), 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile diluent 

25 such as water for injection, saKne solution, fixed oils, polyethylene glycols, glycerine, propylene 
glycol or other synthetic solvents: antibacterial agents such asben7yl alcohol or methyl 
parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates orphosphates, and 
agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 

30 adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed m ampoules, disposable syringes or multiple dose vials made of 
glass or plastic. 
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Pharmaceutical compositions suitable foT injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersion. For intravenous administration, suitable earners 
include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ.) or 
5 phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be 
fluid to the extent that easy syringeability exists. It must be stable under Hie conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and 

10 liquid polyethylene glycol, and the like), and suitable mixtures thereof The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by the use of surfactants. Prevention of ihe 
action of microorganisms can be achieved by various antibacterial and antifungal agents, for 
example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many 

1 5 cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as 
manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound (e.g., 

20 an MOLX protein or anti-MOLX antibody) in die required amount in an appropriate solvent with 
one or a combination of ingredients enumerated above, as required, followed by tillered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 

25 solutions, methods of preparation arc vacuum drying and freeze-drying that yields a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
30 administration, die active compound can be incorporated with excipients and used m the form of 
tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use 
as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and 
expectorated or swallowed. Pbanuacentically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and the 
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] ike can contain any of the fol lowing ingredients, or compounds of a similar nature: a binder 
such as miciocrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Prirnogel, or corn starch; a lubricant such as 
magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent 
5 such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate, or 
orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propeDant, e.g., a gas such 
as carbon dioxide, or a nebulizer. 

10 Systemic administration can also be by transmucosal or transdermal means. For 

transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated 
are used in the formulation. Such penetrants are generally known in the art, and include, for 
example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through Lhe use of nasal sprays ur 

15 suppositories. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in the art 

The compounds can also be prepared in the form of suppositories (eg., with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal 
delivery. 

20 hi one embodiment, the active compounds are prepared with carriers that will protect the 

compound against rapid elimination from the body, such as a controlled release formulation, 
including implants and microencapsulated delivery systems. Biodegradable, biocompatible 
polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyoithoesters, and polylactic acid. Methods for preparation of such formulations will 

25 be apparent to those skilled in the art The materials can also be obtained commercially from 
Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes 
targeted to infected cells with monoclonal antibodies to viral antigens) can al&o be used as 
pharmaceutical ly acceptable carriers. These can be prepared according to methods known to 
those skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1. 

30 It is especially advantageous to formulate oral or parenteral compositions in dosage unit 

form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers 
to physically discrete units suited as unitary dosages for the subject to be treated; each unit 
containing a predetermined quantity of active compound calculated to produce the desired 
therapeutic effect in association with the required pharmaceutical carrier. Ihe specification for 
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the dosage unit forms of the invention are dictated by and directly dependent on the unique 
characteristics of the active compound and the particular therapeutic effect to be achieved, and 
the limitations inherent in the art of compounding such an active compound for the treatment of 
individuals, 

5 The nucleic acid molecules of the invention can be inserted into vectors and used as gene 

theiapy vectors. Gene therapy vectors can be delivered to a subject by, for example, intravenous 
injection, local achiimistration (see, e.g., U.S. Patent No. 5,328,470) or by stereotactic injection 
(see, e.g> Chen, ei aU 1994. Proc. Natl. Acad Set USA 91: 3054-3057). The pharmaceutical 
preparation of the gene therapy vector can include the gene therapy vector in an acceptable 

1 0 diluent, or can. comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced inlact from recombinant 
cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells that 
produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 

1 5 together with instructions for a<hriinistration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express MOLX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy applications), to 
detect MOLX mRNA [e.g. , in a biological sample) or a genetic lesion in an MOLX gene, and to 

20 modulate MOLX activity, as described further, below. In addition, the MOLX proteins can be 
used to screen dnigs or compounds that modulate the MOLX protein activity or expression as 
well as to treat disorders characterized by insufficient or excessive production of MOLX protein 
or production of MOLX protein forms that have decreased or aberrant activity compared to 
MOLX wild-type protein (e.g.' y diabetes (regulates insulin release); obesity (binds and transport 

25 lipids): metabolic disturbances associated with obesity, the metabolic syndrome X as well as 
anorexia and wasting disorders associated with chronic diseases and various cancers, and 
infectious disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, 
the anti-MOLX antibodies of the invention can be used to detect and isolate MOLX proteins and 
modulate MOLX activity. In yet a further aspect, the invention can be used in methods to 

30 influence appetite, absorption of nutrients and die disposition of metabolic substrates in both a 
positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 
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Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, Le. 9 candidate or test compounds or agents (e.g., peptides, 
5 peptidomimetics, small molecules or other drugs) that bind to MOLX proteins or have a 

stimulatory or inhibitory effect on, eg., MOLX protein expression or MOLX protein activity. 
The invention also includes compounds identified in the screening assays described herein, 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of an MOLX 

10 protein or polypeptide or biologically-active portion thereof. The test compounds of the 
invention can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods reqiuring deconvolntion; the 
"one-bead one-compound'' library method; and synthetic library methods using affinity 

1 5 chromatograph}' selection. The biological library approach is limited to peptide Hbraries, while 
the other four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds. See, e.g., Lam, 1997. Anticancer Drug Design 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 

20 molecules can be, e.g.> nucleic acids, peptides, polypeptides, peptidennhnctics, carbohydrates, 
lipids or other organic or inorganic molecules, libraries of chemical and/or biological mixtures, 
such as fungal, bacterial, or algal extracts, are known in the art and can be screened with any of 
the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found m the art, for 

25 example in: DeWilt, et at, 1993 . Proc. Natl. Acad. Set USA. 90: 6909; Erb, et al y 1994. Proc 
Natl Acad. Set USA. 91: 11422; Zuckermann, et aL, 1994. J. Med. Chem. 37: 2678; Cho, at a/., 
1993. Science 261: 1303; Carrel 1, et aL, 1994. Angew. Cheifulnt EdEngl. 33; 2059; Caiell,** 
al., 1994. Angew. Chem. Int. Ed. Engl 33: 2061; and Gallop, et of, 1994. /. Med. Chem. 37: 
1233. 

30 Libraries of compounds may be presented in solution (e.g. , Hougbten, 1 992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 1993. 
Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladncr, U.S. Patent 
5,233,409), plasmids {Cull, et al., \ 992. Proc. Natl Acad. Set USA 89: 1 865-1869) or on phage 
(Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 249: 404-406; Cwirla, et 
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al> 1990. Proc. Natl Acad. Set U.S^i. 87: 6378-6382; Fclici, 1991. J. Mol BioL 222: 301-310; 
Ladner, U.S. Patent No. 5,233,409.). 

hi one embodiment, an assay is a cell-based assay in which a cell which expresses a 
membrane-bound form of MOLX protein, or a biologically-active portion thereof, on the cell 
5 surface is contacted with a test compound and the ability of the test compound to bind to an 
MOLX protein determined- The cell, for example, can of mammalian origin or a yeast cell. 
Determining the ability of the test compound to bind to the MOLX protein can be accomplished, 
for example, by coupling the test compound with a radioisotope or enzymatic label such that 
binding of the test compound to the MOLX protein or biologically-active portion thereof can be 

10 determined by detecting the labeled compound in a complex. For example, test compounds can 
be labeled with 125 L 33 S, t4 C, or ^either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test compounds can 
be enzymatically-labeled with, for example, horseradish peroxidase, alkaline phosphatase, or 
luciferase, and the enzymatic label detected by determination of conversion of an appropriate 

15 substrate to product In one embodiment, the assay comprises contacting a cell which expresses 
a membrane-bound form of MOLX protein, or a biologically-active portion thereof, on the cell 
surface with a known compound which binds MOLX to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to interact 
with an MOLX protein, wherein detemmiing the ability of the test compound to interact with an 

20 MOLX protein comprises detemiining the ability of the test compound to preferentially bind to 
MOLX protein or a biologically-active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of MOLX protein, or a biologically-active portion thereof . 
on the cell surface with a test compound and detemiining the ability of the test compound to 

25 modulate (e.g., stimulate or inhibit) the activity of the MOLX protein or biologically-active 

portion thereof. Detemiining the ability of the test compound to modulate the activity of MOLX 
or a biologically-active portion thereof can be accomplished, for example, by determining the 
ability of the MOLX protein to bind to or interact with an MOLX target molecule. As used 
herein, a "target molecule" is a molecule with which an MOLX protein binds or interacts m 

30 nature, for example, a molecule on the surface of a cell which expresses an MOLX interacting 
protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a 
molecule associated with the internal surface of a cell membrane or a cytoplasmic molecule. An 
MOLX target molecule can be a non-MOLX molecule or an MOLX protein or polypeptide of 
the invention. In one embodiment, an MOLX target molecule is a component of a signal 

121 



WO 01/81578 



PCTATS01/1 3578 



transduction pathway that facilitates transduction of an extracellular signal (e.g. a signal 
generated by binding of a compound to a membrane-bound MOLX molecule) through the cell 
membrane and into the cell. The target, for example, can be a second intercellular protein that 
has catalytic activity or a protein that facilitates the association of downstream signaling 
5 molecules with MOLX, 

Determining the ability of the MOLX protein to bind to or interact with an MOLX target 
molecule can bo accomplished by one of the methods described above for detennining direct 
binding. In one embodiment, determining the ability of the MOLX protein to bind to or interact 
with an MOLX target molecule can be accomplished by detennining the activity of the target 

10 molecule. For example, the activity of the target molecule can be determined by detecting 

induction of a cellular second messenger of the target (i.e. intracellular Ca 2 *, diacylglycerol, IP3, 
etc.), detecting catalytic/enzymatic activity of the target an appropriate substrate, detecting the 
induction of a reporter gene (comprising an MOLX-responsive regulatory element operatively 
linked to a nucleic acid encoding a detectable marker, e.g., luciferase), or detecting a cellular 

1 5 response, for example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an MOLX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to bind to the MOLX protein or biologically-active 
portion thereof. Binding of the test compound to the MOLX protein can be determined either 

20 directly or indirectly as described above. In one such embodiment, the assay comprises 

contacting the MOLX protein or biologically-active portion thereof with a known compound 
which binds MOLX to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with an MOLX protein, 
wherein determining the ability of the test compound to interact with an MOLX protein 

25 comprises determining flic ability of die test compound to preferentially bind to MOLX or 
biologically-active portion thereof as compared to the known compound. 

In still another embodiment, an assay is a cell-ftee assay comprising contacting MOLX 
protein or biologically-active portion thereof with a test compound and determining the ability of 
the test compound to modulate (e.g. stimulate or inhibit) the activity of the MOLX protein or 

30 biologically-active portion thereof. Determining the ability of the test compound to modulate the 
activity of MOLX can be accomplished, for example, by determining the ability of the MOLX 
protein to bind to an MOLX target molecule by one of the methods described above for 
determining direct binding. In an alternative embodiment, detennining the ability of the tesl 
compound to modulate the activity of MOLX pTotein can be accomplished by detennining the 
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ability of the MOLX protein further modulate an MOLX target molecule. For example, tlic 
catalytic/enTyrnatic activity of the target molecule on an appropriate substrate can be determined 
as described, supra. 

In yet another embodiment, the cell-free assay comprises contacting the MOLX protein 
5 or biologicalry-actrve portion thereof with a known compound which binds MOLX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with an MOLX protein, wherein determining the ability 
of the test compound to interact with an MOLX protein comprises deternrirring the ability of the 
MOLX protein to preferentially bind to or modulate the activity of an MOLX target molecule. 

10 The cell-free assays of the invention are amenable to use of both the soluble form or the 

membrane-bound form of MOLX protein. In the case of cell-free assays comprising the 
membrane-bound form of MOLX protein, it may be desirable to utilize a solubilisring agent such 
that the membrane-bound form of MOI.X protein is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, 

1 5 n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-Nnanefliylglucamide, Triton® 
X-100, Triton® X-1 14, Thesit®, Isotriarcypor>(ethylene glycol ether) n , N-dodecyl- 
N^-dimethyl-3-annTiomo-l-propanc sulfonate, 3-{3-cholamidopropyl) dirnethylarnminiol- 
Irpropanc sulfonate (CHAPS), or 3-(3K;holamidopropyl)drmethylamminiol-2-hydroxy- 
1 -propane sulfonate (CHAPSO). 

20 In more than one embodiment of the above assay methods of the invention* it may be 

desirable to immobilize either MOLX protein or its target molecule to facilitate separation of 

• ■ . ..■ \ ' ■ . 

complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to MOLX protein* or interaction of MOLX 
protein with a target molecule in the presence and absence of a candidate comppund, can be 

25 accomplished in any vessel suitable for containing the reaclanb. Examples of such vessels 
include microtiter plates, test tubes, and micro-centrifuge tubes, hi one embodiment, a fusion 
protein can be provided that adds a domain that allows one or both of the proteins to rpe bound to 
a matrix. For example, GST-MOLX fusion proteins or GST-target fusion proteins can be 
adsorbed onto glutatliione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 

30 derivatized microtiter plates, that are then combined with the test compound or the test 

compound and either the non-adsorbed target protein or MOLX protein, and the mixture is 
incubated under conditions conducive to complex formation (eg., at physiological conditions for 
salt and pH). Following incubation, the beads or microtiter plate wells ore washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex determined 
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either directly or indirectly, for example, as described, supra. Alternatively, the complexes can 
be dissociated from the malm* and the level of MOLX protein binding or activity determined 
using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the screening 
5 assays of the invention. For example, either the MOLX protein or its target molecule can be 
immobilized utilizing conjugation of biotin ami streplavidin. Biotinylated MOLX protein or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques 
well-known within the art (e.& , biotinylation kit, Pierce Chemicals, Rockford, TIL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, 

1 0 antibodies reac tive with MOLX protein or target molecules, but which do not interfere with 
binding of {he MOLX protein to its target molecule, can be derivatized to the wells of the plate, 
and xinbound target or MOLX protein trapped in the wells by antibody conjugation. Methods for 
detecting such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with the MOLX 

15 . protein or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic 
activity associated with the MOLX protein or target molecule. 

La another embodiment modulators of MOLX protein expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of MOLX 
mRNA or protein in the cell is determined. Ihe level of expression of MOLX mRNA or protein 

20 in the presence of the candidate compound is compared to the level of expression of MOLX 

mRNA or protein in the absence of the candidate compound. The candidate compound can then 
be identified as a modulator of MOLX mRNA or protein expression based upon this comparison. 
For example, when expression of MOLX mRNA or protein is greater (/,&, statistically 
significantly greater) in the presence of the candidate compound than in its absence, the 

25 candidate compound is identified a stimulator of MOLX mRNA or protein expression. 

Alternatively, when expression of MOLX mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of MOLX mRNA or protein expression. The level of MOLX mRNA or 
protein expression in the cells can be determined by methods described herein lor detecting 

30 MOLX mRNA or protein. 

In yet another aspect of the invention, the MOLX proteins can be used as "bait proteins" 
in a two-hybrid assay or three hybrid assay (see, e.g. 9 U.S. Patent No. 5,283,3 17; Zervos, et al., 
1993. Celt 72: 223-232; Madura, et al. % 1993. J. Biol C/iem. 268: 12046-12054; Bartel, et al % 
IWZ.Bioiechmques 14: 920-924; Iwabuchi, etaU 1993. Oncogene 8: 1693-1696; and Brent 



124 



WO 01/81578 



PCT/US01/13578 



WO 94/10300), to identify other proteins that bind to or interact with MOLX ("MOLX-bmding 
proteins" or "MOLX-bp") and modulate MOLX activity. Such MOLX-binding proteins are also 
likaly to be involved in the propagation of signals by the MOLX proteins as, for example, 
upstream or downstream elements of the MOLX pathway. 
5 The two-hybrid system is based on the modular nature of most transcription factors, 

which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two 
different DNA constructs. In one construct, the gene that codes for MOLX is fused to a gene 
encoding the DNA binding domain of a known transcription factor (e.g. t GAL-4). In the other 
construct, a DNA sequence, from a library of DNA sequences, that encodes an unidentified 

10 protein ("prey 11 or "sample") is fused to a gene that codes for the activation domain of die known 
transcription factor. If the ''bait" and the "prey' 1 proteins arc able to interact, in vivo, forming an 
MOLX-dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene (e.g. . 
LacZ) that is operably linked to a transcriptional regulatory site responsive to the transcription 

IS factor. Expression of the reporter gene can be detected and cell colonics containing the 

functional transcription factor can be isolated and used to obtain the cloned gene that encodes the 
protein which interacts with MOLX, 

The invention further pertains to novel agents identified by the aforementioned screening 
assays and uses thereof for treatments as described herein. 

20 Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the corresponding 
complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way of 
example, aid not of limitation, these sequences can be used to: (i) map their respective genes on 
a chromosome; and, thus, locate gene regions associated with genetic disease; (//) identify an 
25 individual from a minute biological sample (tissue typing); and (in) aid in forensic identification 
of a biological sample. Some of these applications are described in the subsections, below. 

Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
30 sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the MOLX sequences, SEQ ID 
NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or fragments or derivatives thereof, 
can be used to map the location of the MOLX genes, respectively, on a chromosome. The 
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mapping of the MOLX sequences to chromosomes is an important first step in correlating these 
sequences with genes associated with disease. 

Briefly, MOLX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the MOLX sequences. Computer analysis of the MOLX, 
5 sequences can he used to rapidly select primers that do not span more than one exon in the 

genomic DNA, thus complicating the amplification process. These primers can then be used for 
PCR screening of somatic cell hybrids containing individual human chromosomes. Only those 
hybrids containing the human gene corresponding to the MOLX sequences will yield an 
amplified fragment. 

10 Somatic cell hybrids are prepared by fusing somatic cells from different mammals , 

human and mouse cells). As hybrids of human and mouse cells grow and divide, they gradually 
lose human chromosomes in random order, hut retain the mouse chromosomes. By using media 
in which mouse cells canno t grow, because they lack a particular enzyme, but in which human 
cells can, the one human chromosome that contains the gene encoding the needed enzyme will 

15 be retained. By using various media, panels of hybrid cell lines can be established. Each cell 
Hne in a panel contains cither a single human chromosome or a small number of human 
chromosomes, and a full set of mouse chromosomes, allowing easy mapping of individual genes 
to specific human chromosomes. See, e.g. t D'Eustachio, et al 9 1983. Science 220: 919-924. 
Somatic cell hybrids containing only fragments of human chromosomes can also be produced by 

20 using human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day using a 
single thermal cycler. Using the MOLX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

25 Fluorescence in situ hybridization (FISH) of a DNA sequence to a mctaphase 

chromosomal spread can further be used to provide a precise chromosomal location in one step. 
Chromosome spreads can be made using cells whose division has been blocked m metaphase by 
a chemical like colcemid that disrupts the mitotic spindle. The chromosomes can he treated 
briefly with trypsin, and then stained with Giemsa. A pattern of light and dark bands develops 

30 on each chromosome, so that the chromosomes can be identified individually. The FISH 
technique can be used with a DNA sequence as short as 500 or 600 bases. However, clones 
larger than 1 ,000 bases have a higher likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection, Preferably 1 ,000 bases, and more preferably 
2»000 bases, will suffice to get good results at a reasonable amount of time. For a review of this 
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technique, see, Verma, ei al, Human Chromosomes: A Manual of Basic Techniques 
(Pergarnon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for marking 
5 multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding regions of 
the genes actually are preferred for mapping purposes. Coding sequences are more likely to be 
conserved within gene families, thus increasing the chance of cross hybridizations during 
chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 

10 position of the sequence on the chromosome can be correlated with genetic map data. Such data 
are foirad, e.g., in McKusick, MBNDEUAN INHERITANCE IN MAN, available on-line through Johns 
Hopkins University Welch Medical Library) . The rel ationship between genes and disease, 
mapped to the same chromosomal region, can then he identified through linkage analysis 
(co-inheritance of physically adjacent genes), described in, e.g., Egeland, ei al., 1987. Nature* 

15 325; 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the MOLX gene, can be determined. If a mutation is 
observed in some or all of the affected individuals but not hi any unaffected individuals, then the 
mutation is likely to be the causative agent of the particular disease. Comparison of affected and 

20 unaffected individuals generally involves first looking for structural alterations in the 

chromosomes, such as deletions or translocations that are visible from chromosome spreads or 
detectable using PGR based on that DNA sequence. Ultimately, complete sequencing of gpnes 
from several individuals can be performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 

25 

Tissue Typing 

The MOLX sequences of the invention can also be used to identify individuals from 
minute biological samples. In this technique, an individual's genomic DNA is digested with one 
or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
30 identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length rttlymojqahisais/' described in U.S. Patent No. 5,272,057). 

Furthermore, the sequences of the invention can be used to provide an alternative 
technique that determines the actual basc-by-basc DNA sequence of selected portions of an 
individual's genome. Thus, the MOLX sequences described herein can be used to prepare two 
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PCR primers from the 5'- and 3-termini of the sequences. These primers can then he used to 
amplify an individual's DNA and subsequently sequence it 

Panels of corresponding DNA sequences from individuals, prepared in this manner, can 
provide unique individual identifications, as each individual will have a unique set of such DNA 
5 sequences due to allelic differences. The sequences of the invention can be used to obtain such 
identification sequences from individuals and from tissue. The MOLX sequences of the 
invention uniquely represent portions of the human genome. Allelic variation occurs to some 
degree in the coding regions of these sequences, and to a greater degree in the noncoding 
regions. It is estimated that allelic variation between individual humans occurs with a frequency 

10 of about once per each 500 bases. Much of the allelic variation is due to single nucleotide 
polymorphisms (SNPs), which include restriction fragment length polymorphisms (RFLPs). 

Each of the sequences described herein can, to some degree, be used as a standard against 
which DMA from an individual can be compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, fewer sequences are necessary to 

15 differentiate individuals. The noncoding sequences can comfortably provide positive individual 
identification with a panel of perhaps 10 to 1,000 primers that each yield a noncoding amplified 
sequence of 100 bases. If predicted coding sequences, such as those in SEQ IDNOS:l, 3, 5, 7, 
9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29 are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

. 20 

Predictive Medicine 

The invention also pertains to the field of predictive medicine in which diagnostic assays, 
prognostic assays, phannacogenomics, and monitoring clinical trials aTe used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one aspect of 

25 the invention relates to diagnostic assays for determining MOLX protein and/or nucleic acid 
expression as well as MOLX activity, in die context of a biological sample (e.£., blood, serum, 
cells, tissue) to thereby determine whether an individual is afflicted wife a disease or disorder, or 
is at risk of developing a disorder, associated with aberrant MOLX expression or activity. The 
disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 

30 associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, and hematopoietic disorders, and the various dyslipidemias, 
metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 
disorders associated with chronic diseases and various cancers. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is at risk of developing a 
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disorder associated with MOLX protein, nucleic acid expression or activity. For example, 
mutations in an MOLX gene can be assayed in a biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with MOLX protein, nucleic acid expression, or 
biological activity. 

Another aspect of the invention provides methods for determining MOLX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents (e.g., drugs) for therapeutic or prophylactic 
treatment of an individual based on the genotype of the individual (e.g., the genotype of ihe 
individual examined to determine the ability of the individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring die influence of agents (e.g. , 
drug*, compounds) on the expression or activity of MOLX in clinical trials. 

These and other agents are described in further detail in the following sections. 

Diagnostic Assays 

An exemplary method for detecting the presence or absence of MOLX in a biological 
sample involves obtaining a biological sample from a test subject and contacting ihe biological 
sample with a compound or an agent capable of detecting MOLX protein or nucleic acid (e.g. , 
mRNA, genomic DNA) that encodes MOLX protein such that the presence of MOLX is detected 
in the biological sample. An agent for detecting MOLX mRJNA or genomic DNA is a labeled 
nucleic acid probe capable of hybridizing to MOLX mRNA or genomic DNA. The nucleic acid 
probe can be, for example, a full-length MOLX nucleic acid, such as the nucleic acid of SEQ ID 
NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a portion thereof such as an 
oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 
specifically hybridize under stringent conditions to MOLX mKNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting MOLX protein is an antibody capable of binding to MOLX 
protein, preferably an antibody with a detectable labeL Antibodies can be polyclonal* or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g.. Fab or F(ab*)2) can be 
used. The term "labeled", with regard to die probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with 
another reagent that is directly labeled. Examples of indirect labeling include detection of a 
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primary antibody using a fluorescently-labeled secondary antibody and end-labeling of a DNA 
probe with biotm such thai it can be detected with fluorescently-labeled streptavidin. The term 
"biological sample" is intended to include tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present within a subject That is, the detection method 
5 of the invention can be used to detect MOLX mRNA, protein, or genomic DNA in a biotogical 
sample in vitro as well as in vivo. For example, in vitro techniques for detection of MOLX 
mRNA include Northern hybridizations and in situ hybridizations. In \ntro techniques Tot 
detection of MOLX protein include enzyme linked immunosorbent assays (ELlSAs), Western 
blots, immunoprecipitations, and immunofluorescence. In vitro techniques for detection of 

10 MOLX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
detection of MOLX protein include introducing into a subject a labeled anti-MOLX antibody. 
For example, the antibody can be labeled with a radioactive marker whose presence and location 
in a subject can be detected by standard imaging techniques. 

in one embodiment, the biological sample contains protein molecules from the test 

1 5 subject. Alternatively, the biological sample can contain mRNA molecules from the test subject 
or genomic DNA molecules from the test subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control sabjeel, contacting the control sample with a compound or agent capable 

20 of detecting MOLX protein, mRNA, or genomic DNA, such that the presence of MOLX protein, 
mRNA or genomic DNA is detected in the biological sample, and comparing the presence of 
MOLX protein, mRNA or genomic DNA in the control sample with the presence of MOLX 
protein, mRNA or genomic DNA in the teat sample. 

The invention also encompasses kits fox detecting the presence of MOLX m a biological 

25 sample. For example, the kit can comprise: a labeled compound or agent capable of detecting 

MOLX protein or mRNA in a biological sample; means for determining the amount of MOLX in 
the sample; and means for comparing the amount of MOLX in the sample with a standard. The 
compound or agent can be packaged in a suitable container. The kit can further comprise 
instructions for using the kit to detect MOLX pro lein or nucleic acid. 

30 

Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify subjects 
having or at risk of developing a disease or disorder associated with aberrant MOLX expression 
or activity. For example, the assays described herein, such as the preceding diagnostic assays or 
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the following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with MOLX protein, nucleic acid expression or activity. Alternatively, the 
prognostic assays can be utilized to identify a subject having or at risk for developing a disease 
or disorder. Thus, the invention provides a method for identifying a disease or disorder 
5 associated with aberrant MOLX expression or activity in which a test sample is obtained from a 
subject and MOLX protein or nucleic acid (e.g., tnRNA, genomic DNA) is detected, wherein tha 
presence of MOLX protein or nucleic acid is diagnostic for a subject having or al risk of 
developing a disease or disorder associated with aberrant MOLX expression or activity. As used 
herein, a "test sample" refers to a biological sample obtained from a subject of interest For 

10 example, a test sample can be a biological fluid (e.g., sermn), cell sample, or tissue. 

FurtTiennome, the prognostic assays described herein can be used to determine whether a 
subject can be administered an agent (e.g. , an agonist, antagonist, peptidomimetk, protein, 
peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
associated with aberrant MOLX expression or activity. For example, such methods can be used 

15 to determine whether a subject can be effectively treated with an agent for a disorder. Thus, the 
invention provides methods for determining whether a subject can be effectively treated with an 
agent for a disorder associated with aberrant MOLX expression or activity in which a test sample 
is obtained and MOLX protein or nucleic acid is detected (e*g. , wherein the presence of MOLX 
protein or nucleic acid is diagnostic for a subject that can be administered the agent to treat a 

20 disorder associated with aberrant MOLX expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an MOLX 
gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a 

25 genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding an MOLX-protein, or the miscx^ression of the MOLX gene. For example, such, 
genetic lesions can be detected by ascertaining the existence of at least one of: (i) a deletion of 
one or more nucleotides from an MOLX gene; (ii) an addition of one or more nucleotides to an 
MOLX gene ; (lit) a substitution of one or more nucleotides of an MOLX gene, (fv) a 

30 chromosomal rearrangement of an MOLX gene; (v) an alteration in the level of a messenger 
RNA transcript of an MOLX gene, (vx) aberrant modification of an MOLX gene, such as of die 
methylation pattern of the genomic DNA, iyii) the presence of a non- wild-type splicing pattern 
of a messenger RNA transcript of an MOLX gene, (raw) a non- wild-type level of an MOLX 
protein, (ix) allelic loss of an MOLX gene, and (x) inappropriate post-translatioaal modification 
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of an MOLX protein. As described herein, there are a large number or assay techniques Toiown 
in the art which can be used for detecting lesions in an MOLX gene. A preferred biological 
sample is a peripheral blood leukocyte sample isolated by conventional means fiom a subject. 
However, any biological sample containing nucleated cells may be used, including, for example, 
5 buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer in a 
polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such as 
anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g. 9 
Landegran, et at., 1988. Science 241: 10774080; and Nakazawa, et al., 1994. Proc. Natl Acad. 

10 ScL USA 91 : 360-364), the latter of which can be particularly usefiil for detecting point 

mutations in the MOLX-gene (see, Abtavaya, ei al^ 1995. Nud. Acids Res. 23: 675-682). This 
method can include the steps of collecting a sample of cells from a patient, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cefls of the sample, contacting the nucleic acid sample 
with one or more primers that specifically hybridize to an MOLX gene under conditions such 

1 5 that hybridization and amplification of the MOLX gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or detecting the size of the amplification 
product and comparing die length to a control sample. It is anticipated that PCR and/or LCR 
may be desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

20 Alternative amplification methods include: self sustained sequence replication (see, 

Guatelli, etai, 1990. Proc. Natl Acad. Sci. USA 87: 1874-1878), transcriptional amplification 
system (see, Kwoh, et ai 9 1989. Proc Natl Acad. Sci. USA 86: 1 173-1 177); QP Replicase (see, 
Lizardi, ei ah 1988. BioTechnology 6: 1 197). or any other nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well known to those of 

25 skill in die art These detection schemes are especially usefiil for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in an MOLX gene from a sample cell can be 
identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 

30 endoniicleases, and fragment length sizes are determined by gel electrophoresis and compared. 
Differences in fragment length sizes between sample and control DNA indicates mutations in the 
sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g, US. Patent No, 
5,493,531) can be used to score far the presence of specific mutations by development or loss of 
a ribo2yme cleavage site. 
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In other embodiments, genetic mutations in MOLX can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing hundreds 
or thousands of oligonucleotides probes. See, e.g, Cronin, et al, 1996. Human Mutation 7: 
244-255; Kozai, et at., 1996, Nat. Med 2: 753-759. For example, genetic mutations in MOLX 
5 can be identified in tvvo dimensional arrays containing light-generated DNA probes as described 
in Cronin, et al, supra. Briefly, a first hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify base changes between the sequences 
by making linear arrays of sequential overlapping probes, Uris step allows the identification of 
point mutations. This is followed by a second hybridization aiiay that allows the 

1 0 characterization of specific mutations by using smaller, specialized probe arrays complementary 
to all variants or mutations detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment any of a variety of sequencing reactions known in the art can 
be used to directly sequence the MOLX gene and detect mutations by comparing the sequence of 

1 5 the sample MOLX with the corresponding wild-type (control) sequence. Examples of 

sequencing reactions include those based on techniques developed by Maxim and Gilbert, 1977. 
Proc. Natl Acad. Set USA 74: 560 or Sanger, 1977. Proc. Natl Acad. Set USA 74: 5463. It is 
also contemplated that any of a variety of automated sequencing procedures can be utilized when 
performing Ihe diagnostic assays (see, e.g., Naeve, et al. y 1995. Biotechniques 19: 448), 

20 including sequencing by mass spectrometry (see, e.g. , PCX International Publication No. WO 
94/16101; Cohen, et aL, 1996. Adv. Chromatography 36: 127-162; and Griffin, et al., 1993. 
Appl Biochem. Biotechnol. 38: 147-159). 

Other methods for detecting mutations in the MOLX gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or KNA/DN A 

25 heteroduplexes. See, e.g., Myers, et al., 1985. Science 230: 1242. In general, die art technique 
of "mismatch cleavage" starts by providing heteroduplexes of formed by hybridizing (labeled) 
RNA or DNA containing the wild-type MOLX sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated wife an agent that 
cleaves single-stranded regions of Ihe duplex such as which will exist due to basepair 

30 mismatches between the control and sample strands. Fox instance, RNA/DNA duplexes can be 
treated with RNase and DNA/DNA hybrids treated with S j nuclease to enzymaticalfy digesting 
the mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is then 
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separated by size on denaturing polyacrytamide gels to determine the site of mutation. See, e.g., 
Cotton, etal, 1988. Proc. Natl Acad. Sci. USA 85: 4397; Saleeba, etal,, 1992. Methods 
Euzymol 217: 285-295. In an embodiment, the control DNA or RMA can be labeled for 
detectioa 

5 In still another embodiment, the mismatch cleavage reaction employs one or more 

proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair 8 enzymes) in defined systems for detecting and mapping point mutations in 
MOLX cDNAb obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at 

10 G/T mismatches. See, e.g., Hsu, etal, 1994, Carcinogenesis 15: 1657-1662. According to an 
exemplary embodiment, a probe based on an MOLX sequence, e.g., a wild-type MOLX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected 
from electrophoresis protocols or the like. See, e.g„ U.S. Patent No. 5,459,039. 

1 5 In other embodiments, alterations in clcctrophorctic mobility will be used to identify 

mutations in MOLX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility t>etween mutant and wild type 
nucleic acids. See, e.g. p Orita, et cd. 9 1989. Proc. NatL Acad. ScL USA: 86: 2766; Cotton> 1993. 
Mutal Res. 285: 125-144; Hayashi, 1992. Genet. Anal. Teck Appl. 9: 73-79. Single-stranded 

20 DNA fragments of sample and control MOLX nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to sequence, 
the resulting alteration in electrophoretic mobility enables the detection of even a single base 
change. Trie DNA fragments may be labeled or detected with labeled probes. The sensitivity of 
the assay may be enhanced by using RNA (rather than DNA), in which the secondary structure is 

25 more sensitive to a change in sequence. In one embodiment the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g.> Keen, et aL, 1991. Trends Genet. 7: 5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient gel 

30 electrophoresis (DGGE). See, &g. t Myers, et al., 1 985. Nature 313: 495. When DGGE is used 
as the method of analysis. DNA will be modified to instire that it does not completely denature, 
for example by adding a GC clamp of approximately 40 bp of high-melting GC-rioh DNA by 
PCR. In a further embodiment a temperature gradient is used in place of a denaturing gradient 
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to identify differences in the mobility of control and sample DNA. See, e.g. Rosenbaum and 
Reissner, L987. Biophys. Ckem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not limited 
to, selective oligonucleotide hybridization, selective amplification, or selective primer extension, 
5 For example, oligonucleotide primers may be prepared in which the known mutation is placed 
centrally and then hybridi zed to target DN A under conditions that permit hybridization only if a 
perfect match is found See, Saiki.e/o/., 1 986. Natuw 324: 163; Saikuef a/., 1989. Proc. 
Natl Acad. Sci. USA 86: 6230. Such allele specific oligonucleotides are hybridized to PCR 
amplified target DNA or a number of different mutations when the oligonucleotides are attached 

10 to the hybridizing membrane and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective PCR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may carry the mutation of interest in the center of the molecule 
(so that amplification depends on differential hybridization; see, e.g., Gibbs, ei aL> 1989. Nucl. 

15 Acids Res, 17: 2437-2448) or at the extreme 3'-tenninus of one primer where, under appropriate 
conditions, mismatch can prevent, or reduce polymerase extension (see, e.g, Prossner, 1993. 
Tibiech. 1 1 : 238). In addition it may be desirable to introduce a novel restriction site in the 
region of the mutation to create cleavage-based detection. See, e.g., Gasparirri, et aL, 1992. Mol 
Cell Probes 6:1. It is anticipated that in certain embodiments amplification may also be 

20 performed using Taq ligase for amplification. See, e.g., Barany, 1991. Proc. Natl. Acad. Set. 

USA 88: 189. In such cases, ligation will occur only if there is a perfect match at the 34ermiaus 
of the 5' sequence, making it possible to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of amplification. 

The methods described herein may be performed, tor example, by utilizing pre-packaged 

25 diagnostic kits comprising at least one probe nucleic acid or antibody reagent described herein, 
which may be conveniently used, e.g., in clinical settings to diagnose patients exhibiting 
symptoms or family history of a disease or illness involving an MOLX gene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
MOLX is expressed may be utilized in the prognostic assays described herein. However, any 

30 biological sample containing nucleated cells may be used, including, for example, buccal 
mucosa] cells. 
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Ph arm aco genomics 

Agents, or modulators that have a stimulatory or inhibitory effect on MOLX activity 
(e.g., MOLX gene expression), as identified by a screening assay described herein can be 
5 administered to individuals to treat (prophylactkalty or therapeutically) disorders (The disorders 
include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated 
cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, 
immune disorders, and hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity* the metabolic syndrome X and wasting disorders 

10 associated with chronic diseases and various cancers,) In conjunction with such treatment, the 
pharmacogenomics (ie., the study of the relationship between an individual's genotype and that 
individual's response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood concentration of the pharmacologically active drug. 

1 5 Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g. , 
drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Such pharmacogenomics can further be used to determine appropriate dosages and 
therapeutic regimens. Accordingly, the activity of MOLX protein, expression of MOLX nucleic 
acid, or mutation content of MOLX genes in an individual can be determined to thereby select 

20 appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the response 
to dnigs due to altered drug disposition and abnormal action in affected persons. See e.g. 9 
Eichelbaum, 1996. Clbu Exp. Pharmacol Physiol, 23: 983-985; Linder, 1997. Clin. Chcrn., 43: 
254-266. In general, two types of pharmacogenetic conditions can be differentiated Genetic 

25 conditions transmitted as a single factor altering the way drugs act on the body (altered drug 

action) or genetic conditions transmitted as single factors altering the way the body acts on drugs 
(altered drug metabolism). These pharmacogenetic conditions can occur either as rare defects or 
as polymorphisms. For example, glucose-6-phosphaie dehydrogenase (GtfPD) deficiency is a 
common inherited enzymopathy in which the main clinical complication is hemolysis alter 

30 ingestion of oxidant drugs (anti-nmlarials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
determinant of both the intensity and duration of drug action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and 
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cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response and 
serious toxicity after taking the standard and safe dose of a drug. These polymorphisms arc 
expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
5 melahulizer (PM). The prevalence of PM is different among different populations. For example, 
the gene coding for CYP2D6 is highly polymorphic and several mutations have been identified 
in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and 
CYP2C19 quite frequently experience exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic 

10 response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-fonned 
metabolite morphine. At the other extreme arc the so called ultra-rapid metabolizers who do not 
respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been 
identified to be due to CYP2D6 gene amplification, 

Thus, flie activity of MOLX protein, expression of MOLX nucleic acid, ox mutation 

15 content of MOLX genes in an individual can be determined to thereby select appropriate agent(s) 
foT therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic studies 
can be used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to 
the identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 

20 enhance therapeutic or prophylactic efficiency when treating a subject with an MOLX 

modulator, such as a modulator identified by one of the exemplary screening assays described 
herein. 

Monitoring of Effects During Clinical Trials 

25 Monitoring the influence of agents (eg. , drugs, compounds) on the expression or activity 

of MOLX (e.g. , the ability to modulate aberrant ceil proliferation and/or differentiation) can be 
applied not only in basic drug screening, but also in clinical trials. For example, the 
effectiveness of an agent determined by a screening assay as described herein to increase MOLX 
gene expression, protein levels, or upregulate MOLX activity, can be monitored in clinical trails 

30 of subjects exhibiting decreased MOLX gene expression, protein levels, or downregulated 

MOLX activity. Alternatively, the effectiveness of an agent determined by a screening assay to 
decrease MOLX gene expression, protein levels, or downiegulate MOLX activity, can be 
monitored m clinical tails of subjects exhibiting increased MOLX gene expression, protein 
levels, or upregulated MOLX activity. In such clinical trials, the expression or activity of 
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MOLX and, preferably, other genes that have been implicated in, for example, a cellular 
proliferation or immune disorder can be used as a "read out" or markers of the immune 
responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including MOLX, that are modulated in 
5 cells by treatment with an agent compound, drug or small molecule) feat modulates MOLX 
activity (e.g.> identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on cellular proliferation disorders, for example, in a clinical trial, cells 
can be isolated and RNA prepared and analyzed for the levels of expression of MOLX and other 
genes implicated in the disorder. The levels of gene expression a gene expression pattern) 

10 can be quantified by Northern blot analysis or RT-PCR, as described herein, or alternatively by 
measuring the amount of protein produced, by one of the methods as described herein, or by 
measuring the levels of activity of MOLX or other genes. In this manner, the gene expression 
pattern can serve as a marker, indicative of the physiological response of the cells to the agent 
Accordingly, this response state may be determined before, and at various points during, 

15 treatment of the individual with the agent 

In one embodiment, the invention provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, 
peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the screening 
assays described herein) comprising the steps of (i) obtaining a pre-admmistration sample from a 

20 subject prior to administration of the agent 0*0 detecting the level of expression of an MOLX 
protein, mRNA, or genomic DNA in the preadministration sample; (m) obtaining one or more 
post-administration samples from the subject; (fv) detecting the level of expression or activity of 
the MOLX protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the MOLX protein, mRNA, or genomic DNA in the 

25 pre-administration sample with the MOLX protein, mRNA, or genomic DNA in the post 

administration sample or samples; and (vr) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to increase the 
expression or activity of MOLX to higher levels than detected, i.e., to increase the effectiveness 
of the agent, Alternatively, decreased administration of me agent may be desirable to decrease 

30 expression or activity of MOLX to lower levels than detected, r. & , to decrease the effectiveness 
of the agent. 
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Methods of Treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
MOLX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
5 hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

atrioventricular (A-V) canal defect ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, . 
transplantation, adrenoleukodystrophy, congenital adrenal hyperplasia, prostate cancer, 
neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, hypercoagulation, 
1 0 idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus host disease, AIDS, 
bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 
Osteodystrophy, and other diseases, disorders and conditions of the like. 
These methods of treatment will be discussed more lully, below. 

1 5 Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize (z.e., reduce or mhibit) activity. Therapeutics that antagurrize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be 

20 utilized include, but are not limited to: (i) an aforementioned peptide, or ana2ogs, derivatives, 
fragments or homologs thereof; (»") antibodies to an aforementioned peptide; (HQ nucleic acids 
encoding an aforementioned peptide; (rv) administration of antisense nucleic acid and nucleic 
acids (hat are "dysfunctional" (Le\, due to a heterologous insertion within the coding sequences 
of coding sequences to an aforementioned peptide) that are utilized to "knockout" endoggenous 

25 function of an aforementioned peptide by homologous recombination {see, e.g., Capecchi, 1989. 
Scimce 244: 1288-1292); or (v) modulators ( inhibitors, agonists and antagonists, including 
additional peptide mimetic of the invention or antibodies specific to a peptide of the invention) 
that alter the interaction between an aforementioned peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject not 

30 suffering from the disease or disorder) levels or biological activity may be treated with 

Therapeutics that increase (L e., arc agonists to) activity. Therapeutics that uprcgulate activity 
may be administered m a therapeutic or prophyl actic manner. Therapeutics that may be utilised 
include, but are not limited to, an aforementioned peptide, or analogs, derivatives, fragments or 
homologs thereof; or an agonist that increases bioavailability. 
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Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, 
by obtaining a patient tissue sample {e.g., from biopsy tissue) and assaying it hi vitro for RNA or 
peptide levels, structure and/or activity of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known within the art include, but are not limited 
5 to, immunoassays (eg., by Western blot analysis, immunoprecipitation followed by sodium 
dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, irnmunocytochemistry, etc.) and/or 
hybridization assays to delect expression of mRNAs {e.g., Northern assays, dot blots, in situ 
hybridization, and the like). 

10 Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant MOLX expression or activity, by administering to the 
subject an agent that modulates MOLX expression or at least one MOLX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant MOLX expression or activity can be 

1 5 identified by, for example, any or a combination of diagnostic or prognosti c assays as described 
herein. Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the MOLX aberrancy, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending upon the type of MOLX aberrancy, for 
example, an MOLX agonist or MOLX antagonist agent can be used for treating the subject The 

20 appropriate agent can be determined based on screening assays described herein. The 
prophylactic methods of the invention are further discussed m the following subsections. 

• Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating MOLX expression or 
25 activity for therapeutic purposes. The modulatory method of the invention involves contacting a 
cell with an agent that modulates one or more of the activities of MOLX protein activity 
associated with the cell. An agent that modulates MOLX protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a naturally-occucring cognate ligand of an 
MOLX protein, a peptide, an MOLXpeptidommietic, or other small molecule, hi one 
30 embodiment, the agent stimulates one or more MOLX protein activity. Examples of such 

stimulatory agents include active MOLX protein and a nucleic acid molecule encoding MOLX 
that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
MOLX protein activity. Examples of such inhibitory agents include antisense MOLX nucleic 
acid molecules and anti-MOLX antibodies. These modulatory methods can be performed in 
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vitro {e.g. , by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering 
the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an MOLX 
protein or nucleic acid molecule. In one embodiment, the method involves administering an 
5 agent (e.g. , an agent identified by a screening assay described herein), or combination of agents 
that modulates (e.g., np-xegulates or down-regulates) MOLX expression or activity. In another 
embodiment, the method involves administering an MOLX protein or nucleic acid molecule as 
therapy to compensate for reduced or aberrant MOLX expression or activity. 

Stimulation of MOLX activity is desirable in situations in which MOLX is abnormally 
1 0 downregulated and/or in which increased MOLX activity is likely to have a beneficial effect 
One example of such a situation is where a subject has a disorder characterized by aberrant cell 
proliferation and/or differentiation (e.g t , cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease {e.g. 9 preclampsia). 

Determination of the Biological Effect of the Therapeutic 

15 In various embodiments of the invention, suitable in vitro or in vivo assays are performed 

to determine the effect of a specific Therapeutic and whether its administration is indicated for 
treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in (he patients disorder, to determine if a given Therapeutic exerts 

20 the desired effect upon the cell type(s). Compounds for use in therapy may be tested in suitable 
animal model systems including, but not limited to rats, mice, chicken, cows, monkeys, rabbits, 
and the like, prior to testing in human subjects. Similarly, for in vivo testing, any of the animal 
model system known in the art may be used prior to administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

25 The MOLX nucleic acids and proteins of the invention are useful in potential 

prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated 
cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, hematopoietic disorders, and the various dyslipidemias, metabolic disturbances 

30 associated with obesity, the metabolic syndrome X and wasting disorders associated with 

chronic diseases and various cancers. 

As an example, a cDNA encoding the MOLX protein of the invention may be useful in 

gene therapy, and the protein maybe useful when administered to a subject in need thereof By 
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way of non-limiting example, the compositions of the invention will have efficacy for treatment 
of patients suffering from: metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, 
Parkinson's Disorder, immune disorders, hematopoietic disorders, and the various dyslipidexnias. 
5 Both the novel nucleic acid encoding the MOLX protein, and the MOLX protein of the 

invention, or fragments thereof, may also be uscftl in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. A farther use could be 
as an anti-bacterial molecule (Le., some peptides have been found to possess anti-bacterial 
properties). These materials are further useful in the generation of antibodies which 
10 immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

Examples 

Example 1: Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microliter plates 

1 5 containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PCR (RTQ PCR; TAQMAN®). RTQ PCR was performed 
on a Pcrkin-Elmcr Biosystcms ABI PRISM® 7700 Sequence Detection System Various 
collections of samples are assembled on the plates, and referred to as Panel 1 (containing cells 
and cell lines from normal and cancer sources), Panel 2 (containing samples derived from 

20 tissues, in particular from surgical samples, from normal and cancer sources), Panel 3 

(containing samples derived from a wide variety of cancer sources) and Panel 4 (containing cells 
and cell lines from normal cells and cells related to inflammatory conditions). 

First, the RNA samples were normalized to constitutively expressed genes such as (3 - 
actin and GAPDH. RNA (-50 ng total or~l ngpolyA+) was converted to cDNA using the 

25 TAQMAN® Reverse Transcription Reagents Kit (PE Biosystems, Foster City, CA; Catalog No. 
N808-0234) and random hexamers according to the manufacturer's protocol. Reactions were 
performed in 20 ul and incubated for 30 min. at 48°C. cDNA (5 ul) was then transferred to a 
separate plate for the TAQMAN® reaction using (3-actin and GAPDH TAQMAN® Assay 
Reagents (PE Biosystems; Catalog Nos. 43108S1E and 43108S4E, respectively) and 

30 TAQMAN® universal PCR Master Mk (PE Biosystems; Catalog No. 4304447) according to the 
manufacturer's protocol. Reactions were performed in 25 ul using the following parameters; 2 
min. at 50°C; 10 min. at 95°C; 15 sec. at 95°C/1 min. at 60°C (40 cycles). Results were reconled 
as CT values (cycle at which a given sample crosses a threshold level of fluorescence) using a 
log scale; with the difference in RNA concentration between a given sample and the sample with 
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the lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and multiplying by 
1 00. The average CT values obtained for 6-actin and GAPDH were used to normalize RNA 
samples. The RNA sample generating the highest CT value required no further diluting, while 
5 all other samples were diluted relative to this sample according to their (} -actin /GAPDH 
average CT values. 

Normalized RNA (5 ul) was converted to cDNA and analyzed via TAQMAN® using 
One Step RT-PCR Master Mix Reagents (PE Biosystems; Catalog No. 4309169) and gene- 
specific primers according to the manufacturer's instructions. Probes and primers were designed 

1 0 for each assay according to Perfcrn Elmer Biosystem's Primer Express Software package 

(version I lor Apple Computer's Macintosh Power PC) or a similar algorithm using the target 
sequence as input Default settings were used for reaction conditions and the following 
parameters were set before selecting primers: primer concentration = 250 nM, primer melting 
temperature (T m ) range = 58°-60° C, primer optimal Tm = 59° C, maximum primer difference = 

15 2° C, probe does not have 5 * G, probe T TO mustbe 10° C greater than primer Tm, amplicon size 75 
bp to 100 bp. The probes and primers selected (see below) were synthesized by Synthege'n 
(Houston, TX, USA). Probes were double purified by HPLC to remove uncoupled dye and 
evaluated by mass spectroscopy to verify coupling of reporter and quencher dyes to the 5* and 3 * 
ends of the probe, respectively. Their final concentrations were: forward and reverse primers, 

20 900 nM each, and probe, 200nM. 

PCR conditions: Normalized RNA from each tissue and each cell line was spotted in 
each well of a 96 well PCR plate (Perkin Elmer Biosystem s). PCR cocktails including two 
probes (a probe specific for the target clone and another gene-specific probe multiplexed with the 
target probe) were set up using IX TaqMan™ PCR Master Mix for the PE Biosystems 7700, 

25 with 5 mM MgC12, dNTPs (dA, G, C, U at 1:1:1 ;2 ratios), 0.25 U/ml AmpliTaq Gold™ (PE 
Biosystems), and 0.4 U/uJ RNase inhibitor, and 0.25 U/ul reverse transcriptase. Reverse 
transcription was performed at 48° C for 30 minutes followed by amplification/PCR cycles as 
follows: 95° C 10 min, then 40 cycles of 95° C for 1 5 seconds, 60° C for 1 minute. 

30 

In the results for Panel I , the following abbreviations arc used: 

ca. - carcinoma, 

* - established from metastasis, 

met = metastasis, 
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s cell var= small cell variant, 
non-s ~ non-sm =non-small, 
yquam = squamous, 
pL eff = pi effusion = pleural effusion, 
5 glio - glioma, 

astro = astrocytoma, and 
neuro = neuroblastoma, 

10 Panel 2 

Tlx plates for Panel 2 generally include 2 control wells and 94 test samples composed of 
RNA or cDN A isolated from human tissue procured by surgeons working in close cooperation 
with the National Cancer Institute' s Cooperative Human Tissue Network (CHTN) or the 

15 National Disease Research Initiative (NDR1). The tissues are derived from human malignancies 
and in cases where indicated many malignant tissues have '"matched margins" obtained from 
noncancerous tissue just adjacent to the tumor. These are termed normal adjacent tissues and are 
denoted "NAT" in the results below. The tumor tissue and the "matched margins" are evaluated 
by two independent pathologists (the surgical pathologists and again by a pathologists at NDRI 

20 or CHTN). This analysis provides a gross histopathological assessment of tumor differentiation 
grade. Moreover, most samples include the original surgical pathology report that provides 
information regarding the clinical stage of the patien t These matched margins are taken from Qie 
tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated "NAT", for 
normal adjacent tissue, in fable RR). In addition, RNA and cDNA samples were obtained from 

25 various human tissues derived from autopsies performed on elderly people or sudden death 

victims (accidents, etc.). These tissue were ascertained to be free of disease and were purchased 
from various commercial sources such as Clontech (Palo Alto, CA), Research Genetics, and 
Iuvitrogcn. 

RNA integrity from all samples is controlled for quality by visual assessment of agarose 
30 gel electropherograms using 28 S and 1 8S ribosomal RNA staining intensity ratio as a guide (2: 1 
to 2.5: 1 28s : 1 8s) and the absence of low molecular weight RNA s that woul d he indicative of 
degradation products. Samples are controlled against genomic DNA contamination by RTQ 
PCR reactions ran in the absence of reverse transcriptase using probe and primer sets designed to 
amplify across the span of a single exon. 

35 
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Panel 4 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 lest samples) composed 
of RNA (Panel 4i) or cDNA (Panel 4J) isolated from various human cell lines or tissues relaled 
5 to inflammatory conditions. Total RNA from control normal tissues such as colon and lung 
(Stratagene ,La Jolla, CA) and thymus and kidney (Clontech) were employed. Total RNA from 
liver tissue from cirrhosis patients and kidney from lupus patients was obtained from BioChain 
(Biochain Institute, Inc., Hayward, CA). Intestinal tissue for RNA prerparation from patients 
diagnosed as having Crohn's disease and ulcerative colitis was obtained from the National 
10 Disease Research Interchange (NDRI) (Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary arteiy smooth muscle cells, 
small airway epithelium, bronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, human pulmonary aortic endothelial cells, human umbilical 
vein endothelial cells were all purchased from Clonetics (Walkersville, MD) and grown in the 

1 5 media supplied for these cell types by Clonetics. These primary cell types were activated with 
various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as indicated. The 
following cytokines were used; 11 .-1 beta at approximately 1 -5 ng/ml, TNF alpha at 
approximately 5-10 ng/ml, IFN gamma at approximately 20-50 ng/ml, H.-4 at approximately 5- 
10 ng/ml, IL-9 at approximately 5-10 ng/mJ, IL-13 at approximately 5-10 ng/ml Endothelial 

20 cells were sometimes starved for various times by cnltuie in the basal media from Clonetics with 
0.1% serum. 

Mononuclear cells were prepared from blood of employees at CiiraGen Corporation, 
using FicolL LAK cells were prepared from these cells by culture in DMEM 5% FCS 
(Hyclone), 100 jjM non essential amino acids (Gibco/Life Technologies, Rockville, MD), 1 mM 

25 sodium pyruvate (Gibco), mercaptoethanol 5.5 x 1 0' 5 M (Gibco), and 10 mM Hepes (Gibco) and 
Interleukin 2 for 4-6 days* Cells were then either activated with 10-20 ng/ml PMA and 1-2 
ug/ml ionomycin, IL-12 at 5-10 ng/ml, IFN gamma at 20-50 ng/ml and IL-1S at 5-10 ng/ml for 6 
hours. In some cases, mononuclear cells were cultured for 4-5 days in DMEM 5% FCS 
(Hyclone), 100 yM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 

30 mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 mM Hepcs (Gibco) with PHA 

(phytohemagglntinm) or PWM (pokeweed mitogen) at approximately 5 {ig/mL Samples were 
taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte reaction) samples 
were obtained by taking blood from two donors, isolating the mononuclear cells using Ficoll and 
mixing die isolated mononuclear cells 1 : 1 at a final concentration of approximately 2xl0 6 
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cells/ml in DMEM 5% FCS (Hyclone), 100 fiM non essential amino acids (Gibco), 1 rnM 
sodium pyruvate (Gibco), mercaptoethanol (55 x 10 s M) (Gibco), and 10 mM Hepes (Gibco). 
The MLR was cultured and samples taken at various time points ranging from 1- 7 days for RNA 
preparation. 

5 Monocytes were isolated from mononuclear cells using CD! 4 Miltenyi Beads, +ve VS 

selection columns and a Vario Magnet according to the manufacturer's instructions. Monocytes 
were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum (FCS) 
(Hyclone, Logan, UT), 100 jiM non essential amino acids (Gibco), 1 mM sodium pyruvate 
• (Gibco), iuercaptoethanol 5.5 x 10' 5 M (Gibco), and 10 mM Hepes (Gibco), 50 ng/ml GMCSF 

1 0 and 5 ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of monocytes for 5-7 days 
in DMEM 5% FCS (Hyclone), 100 jiM non essential amino acids (Gibco), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x 10' 5 M (Gibco), 10 mM Hepes (Gibco) and 10% AB 
Human Serum or MCSF at approximately 50 ng/ml. Monocytes, macrophages and dendritic cells 
were stimulated for 6 and 12-14 hours with lipopolysaccharide (LPS) at 100 ng/ml. Dendritic 

15 cells were also stimulated with anti-CD40 monoclonal antibody (Pharmmgen) at 10 fag/ml for 6 
and 12-14 hours. 

CD4 lymphocytic CD8 lymphocytes and NK cells were also isolated from mononuclear 
cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection columns and a Vario 
Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 lymphocytes 

20 were isolated by depleting mononuclear cells of CDS, CD56, CD14 and CD1 9 cells using CDS, 
CD56, CD14 and CD 19 Miltenyi beads and +ve selection. Then CD45RO beads were used to 
isolate the CD45RO CD4 lymphocytes with the remaining cells being CD45RA CD4 
lymphocytes. CD45RA CD4, CD45RO CD4 and CDS lymphocytes were placed in DMEM 5% 
FCS (Hyclone), 1 00 pM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 

25 mercaptoethanol 5.5 x 10* 5 M (Gibco), and 10 mM Hepes (Gibco) and plated at 10 6 cells/ml onto 
Falcon 6 well tissue culture plates that had been coated overnight with 0.5 ng/ml antj -CD28 
(Pharmingen) and 3 ug/inl anti-CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells 
were harvested for RNA preparation. To prepare chronically activated CD8 lymphocy fces, we 
activated (he isolated CDS lymphocytes for 4 days on anli-CD28 and anti-CD3 coated plates and 

30 then harvested the cells and expanded them in DMEM 5% FCS (Hyclone), 100 non essential 
amino acids (Gibco), 1 mM sodium pyruvate (Gibco) , mercaptoethanol 5.5 x 10" 5 M (Gibco), 
and 10 mM Hepes (Gibco) and 1L-2. The expanded CDS cells were then activated again with 
plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was isolated 6 
and 24 hours after the second activation and after 4 days of the second expansion culture. The 
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isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 1 00 ]xM non essential amino 
acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 
mM Hepes (Gibco) and IL-2 for 4-6 days before RNA was prepared 

To obtain B ceils, tonsils were procured from NDRI The tonsil was cut up with sterile 
5 dissecting scissors and then passed through a sieve. Tonsil cells were then spun down and 
resirpended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), 100 ,uM non essential amino adds 
(Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10' 5 M (Gibco), and 10 mM 
Hepes (Gibco). To activate the cells, we used PWM at 5 jig/ml or anti-CD40 (Pharmingen) at 
approximately 10 jig/ml and TL-4 at 5-1 0 ng/niL Cells were harvested for RNA preparation at 
10 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates were 
coated overnight with 10 ^g/ml anti-CD28 (Pharmingen) and 2 n&'ml OKT3 (ATCC), and then 
washed twice with PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, German 

Town, MD) were cultured at 10 -1 0 cells/ml in DMEM 5% FCS (Hyclone), 100 pM non 
15 essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10' 5 M 
(Gibco), 10 mM Hepes (Gibco) and IL-2 (4 ng/ml). IL-12(5ng/ml)andanti-n>*(1 ng/ml) 
were used to direct to Thl, while IL-4 (5 ng/ml) and anti-IFN gamma (3 Dg/ml) were used to 
direct to Th2 and IL-10 at 5 ng/ml was used to direct to Trl. After 4-5 days, the activated Thl, 
Th2 and Trl lymphocytes were washed once in DMBM and expanded for 4-7 days in DMEM 
20 5% FCS (Hyclone), 100 pM non essential amino acids (Gibco).. 1 mM sodium pyruvate (Gibco), 
mercaptoethanol 5.5 x 10" 5 M (Gibco), 10 rnM Hepes (Gibco) and IL-2 (1 ng/ml). Following 
this, ths activated Thl, Th2 and Trl lymphocytes were re-stimulated for 5 days with anti- 
CD28/OKT3 and cytokines as described above, but with the addition of anti-CD95L (I Dg/ml) 
to prevent apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
25 expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were maintained in 
this way for a maximum of three cycles. RNA was prepared from primary and secondary Thl, 
Th2 and Trl alter 6 and 24 hours following the second and third activations with plate bound 
anti-CD3 and anti-CD28 mAbs and 4 days into the second and third expansion cultures in 
hiterleukin2. 

30 The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-l, KU- 

812. EOL cells were further differentiated by culture in 0.1 inM dbcAMP at 5 xlO 5 cells/ml for 
8 days, changing the media every 3 days and adjusting the cell concentration to 5 xlO 5 cells/mL 
For the culture of these cells, we used DMEM or RPMI (as recommended by the ATCC), with 
the addition of 5% FCS (Hyclone), 100 yM non essential amino acids (Gibco), 1 mM sodium 
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pyruvate (Gibco), mercaptoethanol 5.5 x 10' 5 M (Gibco), 10 mM Hepes (Gibco). RNA was 
either prepared &om resting cells or cells activated with PMA at 10 ng/ml and ionomycin at 1 
fig/ml for 6 and 14 hours. Keratinooyte line CCD106 and an airway epithelial tumor line NCI- 
H292 were also obtained from the ATCC Both were cultured fa DMEM 5% FCS (Hyclone), 
1 00 uM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoelhanol 5.5 
x 10* 5 M (Gibco), and 10 mM Hepes (Gibco). CCD1106 cells were activated for 6 and 14 hours 
with approximately 5 ngftnl TNF alpha and 1 ng/ml IL-1 beta, while NCI-H292 cells were 
activated for 6 and L4 hours with the following cytokines: 5 ng/ml 1L-4, 5 ng/ml IL-9, 5 ng/ml 
IL-13 and 25 ng/ml IFN gamma. 

For these cell Kncs and blood cells, RNA was prepared by lysfag approximately 10 7 
cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane (Molecular 
Research Corporation) was added to the RNA sample, vortexed and after 1 0 minutes at room 
temperature, the tubes were spun at 14,000 rpin in a Sorvall 3S34 rotor. The aqueous phase was 
removed and placed in a 15 ml Falcon Tube. An equal volume of tsopropanol was added and left 
at -20 degrees C overnight. The precipitated RNA was spun down at 9,000 rpm for 15 min in a 
Sorvall SS34 rotor and washed in 70% ethanol. The pellet was redissolved in 300 ul of RNAse- 
finee water and 35 ul buffer (Promega) 5 ul DTT, 7 ul RNAsin and 8 pi DNAse were added. The 
tube was incubated at 37 degrees C tor 30 minutes to remove contaminating genomic DNA, 
extracted once with phenol chloroform and re-precipitated with 1/10 volume of 3 M sodium 
acetate and 2 volumes of 100% eflianoL The RNA was spun down and placed in RNAse free 
water. RNA was stored at -80 degrees C. 

IVfOLI 

Expression of gene GMJ799G0178 was assessed using the primer-probe set Agl605, 
described in Table 12. Results of the RTQ-PCR runs are shown in Tables 13 and 14. 



Table 12. Probe Name: Agl605 



Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQID 
NO. 


Forward 


5'-CCTGAGCTACAACAACATCATG-3' 


58.3 


22 


333 


76 


Probe 


FAM.-5- 

CCTCATATCCCTGTCCC11CAGCCATA- 
3-TAMRA 


69 


26 


378 


77 


Reverse _ 


S'-GCAGAGTCTAGCATCAGGATGTXJ' 


58.6 


22 


407 


78 



148 



WO 01/81578 



PCTATS01/13578 



Table 13. Panels I3D and 2D 



PANEL UD 




PANEL 2D 






Tissue Name 


Rel. Expr., 
% 

1.3dx4tm54 

UZI agIOLF3 

bl 


Tissue Name 


Rel. Expr., 

70 

2Dtm2728f_ 
agi ui/j 


Rel. Expr., 
% 

2dx4tm4732 
r ag i iiu.> a 
2 


Liver 

adenocarcinoma 


1.9 


Normal Colon GENPAK 
061003 


70.2 


70.3 


Pancreas 


0 


83219 CC Well to Mod Diff 
(OD03866) 


8.5 


63 


Pancreatic ca, 
CAPAN2 


17.6 


83220 CC NAT (OD03866) 


8.1 


9.4 


Adrenal gland 


1.3 


83221 CC Gr.2 rectosigmoid 
(OD03868) 


4.5 


12.1 


Thyroid 


5.6 


83222 CCNAT(OD03868) 


1.6 


6.3 


Salivary gland 


8.9 


83235 CC Mod Diff 


17.9 


19.7 


Pituitary gland 


1.7 


83236 CC NAT (ODO3920) 


21.5 


11.6 


Brain (fetal) 


21.2 


83237 CC Gr2 ascend colon 
(OD03921) 


18 


34.4 


Brain (whole) 


30.5 


83238 CC NAT (OD03921) 


8.9 


9.8 


Brain 

(amygdala) 


6.6 


83241 CC from Partial 
Hepatectomy {VUU43W) 


7.1 


4.9 


Brain 

(cerebellum) 


14.6 


83242 Liver NAT (ODO4309) 


17.1 


6.9 


Brain 

(hippocampus) 


8.7 


87472 Colon mets to lung 


6.7 


11.4 


Brain 

(substantia 

nigra) 


93 


87473 Lung NAT (OD04451- 
02) 


0.8 


23 


Brain 
(thalamus) 


4.8 


Normal Prostate Clontech A+ 
6546-1 


.17.8 


52.9 


Cerebral Lortex 


2.6 


84140 Prostate Cancer 
(OD04410) 


18.6 


23 


Spinal cord 


1 * 7 


Q/r r A 1 Brric+nfn VAT 

(OD04410) 


O I 

y.l 


LL 


CMS ca. 

(glio/astro) 

U87-MG 


0 


87073 Prostate Cancer 
(OD04720-01) 


20 


14.5 


CMS ca. 
(glio/astro) U- 
118-MG 


1.5 


87074 Prostate NAT 
(OD04720-02) 


21.3 


38.1 


CMS ca. (astro) 
SW1783 


0 


Normal Lung GfiNPAK 
061010 


69.7 


85.9 
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CNSca.* 
(neuro; met ) 

OTT XT AO 


1.5 


83239 Lung Met to Muscle 
(OD04286) 


11.1 


6.3 


CNS ca. (astro) 

or -53V 


0 


83240 Muscle NAT 

(UDU4ZoOj 


15.1 


23.3 


CNS ca, (astro) 


0 


84136 Lung Malignant Cancer 


30.6 


30.8 


CNSca. (glio) 


9.7 


84137 Lung NAT (OD03126) 


8 


16.3 


CNS ca, (glio) 
U251 


4.8 


84871 Lung Cancer 
(OD04404) 


41.8 


45.2 


CNS ca, (glio) 
SF-295 


7.7 


84872 Lung NAT (OD04404) 


15.4 


27.5 


Heart (fetal) 


0.9 


84875 Lung Cancer 
(OD04565) 


7.4 


8.5 


Heart 


6.1 


84876 Lung NAT (OD04565) 


5-3 


1.2 


Fetal Skeletal 


3.5 


85950 Lung Cancer 
(OD04237-01) 


77.9 


72.9 


Skeletal muscle 


100 


85970 Lung NAT (OD04237- 
02) 


14.3 


12.9 


jjunc nidiiu w 


11 R 


ojZjj vjcuiar jviei jviex to 
Liver (ODO4310) 


1 7 


0 


inymus 








£ 1 

D.I 


Spleen 


14.6 


84 1 3 9 Melanoma Mets to 
Lung(OD04321) 


1.3 


62 


Lymph node 


57.9 


84138 Lung NAT (OD04321) 


36.9 


' 22.4 


Colorectal 


3.4 


Normal Kidney GENPAK 
061008 


9.2 


192 


Stomach 


20.7 


83786 Kidney Ca, Nuclear 
grade 2 (OD04338) 


4.2 


6.9 


Small intestine 


33,5 


83787 Kidney NAT 
(OD04338) 




8.3 


Colon ca. 
SW480 


0,6 


83788 Kidney Ca Nuclear 
grade 1/2{OD04339) 


18,7 


11.3 


Colon ca,* 

(SW480 

met)SW620 


0 


83789 Kidney NAT 
(OD04339) 


4.9 


8.5 


Colon ca. 
HT29 


0 


83790 Kidney Ca, Clear cell 
type(OD04340) 


6.3 | 


14.8 


vjoion ca. 
HCT-116 


0 


io3791 Kidney MAI 
(OD04340) 


10.7 


f A C 

14.6 


Colon ca. 
CaCo-2 


2.2 


83792 Kidney Ca. Nuclear 
grade 3 (OD04348) 


6 


7.5 


83219 CC Well 

toModDiff 

COD03866) 


0 


83793 Kidney NAT 
(OD04348) 


14.5 


16.5 
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Colon ca. 
HCC-2998 


0 


87474 Kidney Cancer 
(OD04622-01) 


15.4 


25 


Gastric ca-* 
(liver met) NCI- 

MOT 


7.2 


87475 Kidney NAT 
(OD04622-03) 


2.3 


2.8 


Bladder 


2.5 


85973 Kidney Cancer 
(OD04450-01) 


3.6 


7.3 


Trachea 


4.9 


85974 Kidney NAT 
(OD0445(M)3) 


1.3 


13.3 


Kidney 


0 


Kidney Cancer Clontech 

Dll/^JTA/7 


7.3 


52 


Kidney (fetal) 


7 


Kidney NAT Clontech 
8120608 


1.1 


3.6 


Renal ca> 
786-0 


0 


Kidney Cancer ClonLech 
8120613 


2.8 


7.5 


Renal ca. 
A498 


1.5 


Kidney NAT Clontech 
8120614 


3.7 


8 


Renal ca. 

Ti"VT7 n A1 

RXF 393 


3 


Kidney Cancer Clontech 
9010320 


17.4 


14.1 


Renal ca. 
ACHN 


LI 


Kidney NAT Clontech 

AA1 f>-T> 1 

9010321 


16.4 


11.3 


Renal ca. 
UO-31 


0 


Normal Uterus GENPAK 
061018 


4.6 


52 


Renal ca. 
TK-10 


12 


Uterus Cancer GENPAK 
064011 


17.2 


32.4 


Liver 


0.6 


Normal Thyroid Clontech A+ 
6570-1 


17.3 


18.1 


Liver (fetal) 

Liver ca. 

(hcpatoblast) 

HepG2 


8.8 


Thyroid Cancer GENPAK 
064010 


6.5 


1.1 


24.6 


Thyroid Cancer 
INVTTROGEN A302 i 52 


1.7 


1.6 


Lung 


1.3 


Thyroid NAT INWTROGEN 
A302153 


9.2 


17.7 


Lung (fetal) 


15.8 


Normal Breast GENPAK 
061019 


21.3 


30.1 


Lung ca. (small 
cell) LX-I 


1.5 


84877 Breast Cancer 
(OD04d66) 


1.4 


1.5 


Lung ca. (small 
cell) NC1-H69 


4.2 


85975 Breast Cancer 

ffW\ A A C A A A 1 \ 

(OD04390-01) 


12.9 


10.1 


Lungca.(s.ceH 

™ r \ otrp 77 
var.j otw-i i 


1.4 


85976 Breast Cancer Mets 
fTYnna^Qft m\ 

IUJJUhjS'U-U.; ) 


100 


8S 


Lung ca. (large 
cell)NCI-H460 


0 


87070 Breast Cancer 
Metastasis (OD04655-05) 


32.8 


0 


Lung ca. (non- 
sm. cell)A549 


2.9 


GENPAK Brea&i Cancer 
064006 


25 


28.5 


Lung ca. (non- 
s.cell)NCI-H23 


9.5 


Breast Cancer Res. Gen. 1024 


51.8 


66.3 
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Lung ca (non- 
s.cb11) HOP-62 


3 A 


meast Lancer uiontccn 
9100266 




25.1 


Lung ca. (iioji- 
s.cl) NCI-ri5ZZ 


0 


Breast NAT Ctoiuech 9100265 


18.2 


3L7 


|Lung ca. 
(squam.) bW 
900 


4.7 


Breast Cancer EWITROGEN 


15 


21 


Lung ca, 
(squam.; NCl- 
H596 


0 


Breast NAT INVTTROGEN 
A2U9U734 


16.2 


12.3 


Mammary 
gland 


0 


Normal liver GENPAK 
061009 


5.6 


4.9 


Breast ca.*(pl. 
effusion) MCF- 

1 


0.7 


Liver Cancer GENPAK 
064003 


5.4 


11.8 


Breast ca.* 
(pLef) MDA- 
MB-231 


6.1 


Liver Cancer Research 
Genetics RNA 1025 


3.7 


3.6 


Breast ca.* (pi. 

effusion) 

T47D 


5.1 


Liver Cancer Research 
Genetics RNA 1026 


4.8 


5.5 


Breast ca, 
BT-549 


2.8 


Paired liver Cancer Tissue 
Research Genetics RNA 6004- 
T 


10.6 


7 


Breast ca. 
MDA-N 


0 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


19.3 


28.4 


Ovary 


2.7 


Paired Liver Cancer Tissue 
Research Genetics RNA 6005- 

T 
1 


4.6 


6 


Ovarian ca. 

c\\7r* AD 1 


3.2 


Paired liver Tissue Research 
ueneucs kin a ouu j-in 


5 


0 


Ovarian ca. 

UYCAK-4 


3.7 


Normal Bladder GENPAK 


18.3 


11.5 


Ovarian ca. 
OVCAR-5 


2.5 


Bladder Cancer Research 
Genetics RNA 1023 


7.9 


21.4 


Ovarian ca 
OVCAR-8 


1.7 


ni_ jj„ 

Bladder Cancer 
INVITROGnNA302173 


33 


18.9 


Ovarian ca, 
IGROV-1 


0 


87071 Bladder Cancer 
(OD04718-01) 


11 


18.5 


Ovarian ca.* 
(ascites) SK- 
OV-3 


1.8 


87072 Bladder Normal 
Adjacent (OD04718-03) 


9.4 


9.7 


Uterus 


24.6 • 


Normal Ovary Res. Gen. 


11.6 


11.3 


Placenta 


2.8 


Ovarian Cancer GENPAK. 
064008 


23.3 


22.9 


Prostate 


2.5 


87492 Ovary Cancer 
(OD04768-07) 


80.1 


100 


Prostate ca.* 
(boue met)PC-3 


26.5 


87493 Ovary NAT 
(OD04768-08) 


3.8 


6.4 
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Testis 


u 


Normal Stomach GENPAK 
061017 


1 C C\ 

15.9 


17.1 


Melanoma 
HsG88(A).T 


ft 
U 


usmmu Cancer Clontecn 
9060358 


36.3 


45.0 


Melanoma* 
(met) 


0 


NAT Stomach Clontecn 
9060359 


44.4 


60.8 


Melanoma 
UACC-62 


12 


Gastric Cancer Clontech 
9060395 


13.8 


20.4 


Melanoma 
M14 


0 


NAT Stomach Clontech 
9060394 


78.5 


93 


ivieianoma 
LOXIMVI 


c\ £ 
u.o 


urasmc vauccr uioncecn 
9060397 


5 £ 
J.O 




Melanoma* 
(met) SK- 
MEL-5 


4.1 


NAT Stomach Clontech 
9060396 


25.5 


30.1 | 


Adipose 


2 


Gastric Cancer GENPAK 
064005 


35.1 


. 27.6 



Table 14. Panel 3D 



Tissue Name 


Rel. Expr., % 
3dtm5227f_agl605 


94905J^aoy_MeduIloblastoxaa/'Cerebelluin_sscDNA 


0.0 


94906_TB67 1 JVteduIloblastoin/Cerebellum^scDNA 


0.4 


94907_B283 Mcd_Mcdi]lIoblastomfl/Ccrcbcllum_sscDNA 


0.0 


94908_PFSK-l_Primitive 
Neuroectodermal/Cerebelhim sscDNA 


0.0 


94909_XF-49S_CNS_ssoDNA 


0.0 


94910_SNB-78„CNS/gKoma - s3cDNA 


0.0 


9491 !_S>-268_CNS/gliobIastoma - sscDNA 


0.0 


94912 T98G Glioblastoma sscENA 


0.0 


96776_SK-N-SH_Newroblastoma(metastasis)_sscDNA 


0.0 


949 13_SF-295_CNS/gliobIastoma_sscDNA 


02 


949 1 4_Cerebel lum_RscDNA 


1.9 


96777 jCerebellum_sscDNA 


3.4 


94916_NCI-H292_Mucoepidermoid lung carcinoma_sscDNA 


0.4 


94917_DMS-1 !4_Small cell lung cancer_sscDNA 


0.0 


94918_JDMS-79__Small ccU lung 
cancer/neuroendocrine sscDNA 


3.8 


94919_NCI-H146_Small cell lung 
cancer/neuroendocrine sscDNA 


0.0 


94920_NCI-H526_Small ceU lung 
cancer/neuroendocrine sscDNA 


14.6 


94921JNCI-N4 17_Small cell lung 
cancer/neuroendocrine sscDNA 


0.0 
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94923_NCI-HS2_SmaU cell tang 


0.2 


94924 NCI-H157 Sauamous cell Inns cancer 
(metastasis)_sscDNA 


0.0 


94925_NCI-H1 155_Large eel! lung 
cancer/neuroendocrine sscDNA 


0.0 


94926 JTCI-H1299_Large cell lung 
cancer/neuroendocrine sscDNA 


0.0 


94927 JNCI-H /27 JLnng caTcmoid_sscDNA 


0.5 


94928_NC1-UMC-1 lJLung carcinoid^sscDNA 


6.6 


94929_LX>l_Small cell lung cancer_sscDNA 


0.2 


94930_Colo-205_Colon cancer_sscDNA 


0.0 


9493I_KM1 2 ^Colon cancer_sscDNA 


0.0 


94932_KM20L2_Colun cancer^sscDNA 


0.0 


94933 NCI-H716 Colon cancer sscDNA 


0.0 


94935_SW-48_Colon adenocarcinomasscDNA 


0.1 


94936_SW1 1 16_Colon adenocaTcinoma_sscDMA 


0.0 


94937JLS 174T_Colon adenocarcinoma_sscDNA 


0.0 


9493S_SW-948_Colon adenocaicinomasscDNA 


0.0 


94939JS\V-4S0_Colon adencwarmnoma_sscDNA 


0.0 


94940 J^CI-SNU-5_Gastric cardnoma_sscDNA 


0.2 


94941_KATO ]JT_Gastric carcinoma_sscDNA 


0.4 


94943 NCi-SNU-16 Gastric carcinoma sscDNA 


0.0 


94944 NQ-SNU-1 Gastric carcinoma sscDNA 


0.0 


94946_RF-l_Gastric adcnocarcinoma_sscDNA 


15.9 


94947_RF -4 S_Gastric adenocarcinomasscDNA 


; _ — — 

18.7 


96778_MKN-45_Gastric carcinotnasscDNA 


0.0 


94949 NCT-N87 Gastric carcinoma sscDNA 


0.0 


94951 pVCAR-S_Ovariancarcinoma_sscDNA 


0.0 


94952_RL95-2_Uterine carcinoma_sscDNA 


0.0 


94953_HelaS3_Cervical adenocarcinoina_sscDNA 


0.0 


949^54 fa Slci fftrvica? firridftrmcvid cfwcATkcmna 
(iiietastasis)j3ScDNA 


n ? 1 


9*i9SS FS-2 Ovarian dftar cpW carciivwiia ^ciM YN~ A 




94957 Ratrios/6h stim "* Stimulated with Pft/fA/innomvcin 
6h sscDNA 


14 7 


94958 Ramos/1 4h stim Stimulated with PMA/ionomycin 
14h sscDNA 


22.7 


94962_MEG-01_Chronic myelogenous leukemia 
(megokaryoblasf) sscDNA 


0.5 


94963 JSajiBurkitt's t>TTiphoina_8ScDNA 


32.3 


94964_Daudi_Burldtfs lymphoma^sscDNA 


100.0 


94965JJ266J3-cell plasmacytoma/myeIonm_>scDNA 


0.3 


94968J^A46JBurkitf s lymphoma_sscDNA 


22.5 


p4970_RL_non-IIodgkin's B-cell lymphoma sscDNA 


29.5 
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94972 jMl_pre-B-cell lymphcTma/leukeniia_sscDNA 
94973 Jurkat T cell leukemia sscDNA 


23.5 
2.2 


94974 JTF- lErylhroleufcemia jsscDNA 


0.3 


94975_HUT 78 JT-cell lymphoma_sscDNA 


1.6 


94977 JJ937JKistiocytic lynipli0fl2a_5scDNA 


7.3 


94980 JGJ-8l2_Myelogenous leiikentia_S5cDNA 


0.1 


9498J_769-P_Clear cell renal caicinoma^sscDNA 


0.2 


949 8 3_C aki-2_Clear cell renal carcinomasscDNA 


0.1 


94984 SW839 Qear cell renal carcinoma sscDNA 


1.0 


94986_G401_Wilms T tiimnrjwcDNA 


0.0 


94987_IIs766T_Pancreatic carcinoma (LN raetastasis)_sscDNA 


0.1 


94988_CAPAN-1 J?ancreatic adenocarcinoma (liver 
metastasis) sscDNA 


0.0 


949S9_SU86.86_Pancrcatic carcinoma (liver 
inetaslabi s)_bscDN A 


0.0 


94990 BxPC-3 Pancreatic adenocarcinoma sscDNA 


0.1 


94991 HPAC Pancreatic adenocarcinoma sscDNA 


0.1 


94992_MIA PaCa-2_Pancreatic carcinomasscDNA 


0.0 


94993_CFPAO-l_Pancceatic ductal adenocarcinoma_sscDNA 


0.2 


94994_PANC-l_Pancreatic epithelioid ductal 
carcinoma sscDNA 


0.0 


94996_T24_Bladder carcinma (transitional cell)jsscDNA 


0.0 


94997_5637_Bladder carcinoma.sscDMA 


0.0 
0.0 


94998_HT-1 197JBladder carcinomajsscDNA 


94999_IJM-UC-3_Bladder carcinma (transitional celi)_sscDNA 


0.0 


95000_A204_Rhabdomyosarcoma_sscDNA 


0.0 


95001_ET-1080_Fibrosarcoma_sscDNA 


0.0 


95002 JvlG-63_Osteosarcoma (bone)_sscDNA 


0.1 


95003_SK-LMS-l_Leiomyosarcoma (vnlva)_sscDNA 


0.2 


95004JSJ RH3 O Rhabdomyosaicoma (met to bone 
marrow)_sscDNA 


0.0 


95005_A431_Epi dermoid carcinoma_RscDNA 


0.0 


95007 WM266-4 Melanoma sscDNA 


0.0 


9501 0_DU 145_Prostate carcinoma (brain metastasis)_sscDNA 


0.0 


950 12_MD A-MB-46 8_Breast adenocarcinoma_sscDNA 


0.0 


95013_SCC-4_Squamous cell carcinoma of longue_sscDNA 


o.o . 


95014_SCC-9_Squamous cell carcinoma of tongue_sscDNA 


03 | 


95015_SCC-l5_Squamous cell carcinoma of tongue sscDNA 


0.0 


95017_CAL 27_Squamous cell carcinoma of tongue_sscDNA 


0.0 



The RTQ-PCR analysis (Table 13 and 14) reveals that MOL1 is predominantly expressed 
in cell lines derived from lymphoma and leukemia, specifically BuTkitts lymphoma in panel 3D. 
This result is supported by the presence of GenBank ESTs coming from T cells from T cell 
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leukemia (see Unigcnc http://wwjicbi.nlmji^ 

Recent report indicates that this receptdr nomally aids the immune cells to sense the presence of 
unmethylated CpG dinucleotides (Hemmi H, Tafojuchi O, Kawai T, Kaisho T, Sato S, Sanjo H, 
Matsumoto M, Hoshiuo K, Wagner H, Takeda K, Akira S. A Toll-like receptor recognizes 
5 bacterial USA. Nature 2000 Dec 7;4O8(6813):740-5) and to induce proliferation of splcnocytes, 
infkmmatory cytokine production from macrophages and maturation of dendritic cells. The 
signaling pathway mediated by toll-like receptor 9 is through the activation of NF-kB. There is 
evidence that NF-kappaB activity is necessary for survival of lymphoma and leukemia cells 
(Constitutive activation of NF-kappaB in primary adult T-cell leukemia cells. Mori N 3 Fujii M, 

10 Ikeda S, Yainada Y, Tomonaga M ? BaltaTd DW S Yamamoto N. Blood 1999 Apr l;93(7):2360-8; 
Inhibition of NF-kappaB induces apoptosis of KSHV-infected primary effusion lymphoma cells. 
Keller SA, Schattner EJ, CesarmanE. Blood, I October 2000, 96, No. 7, pp. 2537-2542). 
Overexpression of toll-like receptor 9 by lymphoma and leukemia cells is likely to mediate 
ligand-independent signaling, affecting the normal processes of activation, proliferation and 

15 tumorogenesis. Therefore the protein encoded (GMJ79960178) may serve as a potential marker 
for lymphoma and leukemia cells. In addition, human monoclonal antibodies directed against 
this protein could be therapeutics for the treatment of lymphoma and leukemia. 

MOL2 

20 Expression of gene MOL2 was assessed using the primer-probe set Ag743 , described in 

Table 1 5. Results of the RTQ-PCR runs are shown in Tabic 16. 



Table IS* Probe Name: Ag743 



Primers 


Sequences 


Tm 


Lengt 
h 


Stan 
Position 


"SEQID 
NO: 


Forward 


5'-ATGTCTTGGTGGATGCAGAA-3' 


59.1 


20 


1304 


79 


Probe 


TET-5'- 

CGGACTATAGCATTTCTAAGCGCCTCG- 
3-TAMRA 


69.2 


27 


1340 


80 


Reverse 


5'jCACATCCTCCTTGCAAATGT-3' 


58.6 


20 i 1370 


SI 



25 " 

Table 16. Panels 13J> and4D 



PANEL 1.3 D 




PANEL 4D 
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[Tissue Name 

i 


Rel. Expr,, % 
L.3dx4lm5o04t ag743 
al 


Tissue Name 


Rel. Expr,, % 
4Dtm2477t a£7 
43 


Liver 

.adenocarcino 
ma 


If A 

36.4 


93 /68J5econdary Tbl_anti- 
CD28/anti-CD3 


27.2 


[Pancreas 


4.6 


93769_Secondary Ih2_anti- 
LJ/2o/anii-CD3 


19.6 


Ipancreatic ca. 
CAP AN 2 


16.6 


93770_Secondary Trljmti- 
CD28/anti-CD3 


23.2 


Adrenal gland 


6.5 


93573_Secondary Th ^resting 
day 4-6 m IL-2 


7.2 


Thyroid 


5.2 


93572_Secondary Th2jresting 
day 4-6 in IL-2 


7.1 


Salivary gland 


12.0 


93571_Secondary TrL_resting 
day 4-6 in IL-2 


11.9 


Pituitary gland 


22.7 


93568 jprimaiy Thljmti- 
CD28/anti-CD3 


43.2 


Brain (fetal) 
Brain (whole) 


8.4 
38.6~ 


93569 jrimaty Th2_anti- 
CD28/anti-CD3 
93570_primaty Trl_anti- 
CD28/anti-CD3 


32.1 
~~ 50.0 


Brain 

(amygdala) 


24.8 


93565 jrimarylhl resting dy 
^6mIL-2 


43.8 


Brain 

(cerebellum) 


31.4 


93566_primary Th2 resting dy 
4-6 in IL-2 


16.4 


Brain 

(hippocampus) 


22.0 


93567_primary Trl_Testmg dy 
4-6 in IL-2 


22.4 


Brain 

(substantia 

nigra) 


n.s ~ 


93351_CD45RACD4 
lymphocyte anti-CD28/anti- 
CD3 


~ 23.0 


Brain 
(thalamus) 


30.3 


93352_CD45RO CD4 
lymphoc>1e anti-CD28/anti- 
CD3 


25.5 


Cerebral 
Cortex 


22.0 


9325I_CD8 Lymphocytesjinti- 
CD28/anti-CD3 


16.0 


Spinal cord 


20.6 


93353j±rorric CD8 
Lymphocytes 2ry jesting dy 4-6 
in IL-2 


15.0 


CNSca, 

(glio/astro) 

U87-MG 


28.1 


93574_chronic CD8 
Lymphocytes 2ry_activated 
CD3/CD28 


10.2 


CNS ca. 

(glio/astro) 

U-llS^MG 


27.9 


93354_CD4_none 


4.0 


CNS ca. 

(astro) 

SW1783 


31.1 


93252 Secondary 
Thl/Th2/Trljuiti-OD95 CH11 


9.6 


CNSca* 

/ \ 


193 


93 1 03_LAK ccllsjresting 


12.9 
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(neuro; met ) 
SK-N-AS 








CNSca. 

(astro) 

SF-539 


47.2 


93788JLAKcellsJL-2 


14.9 


CNSca. 

(astro) 

SNB-75 


10.2 


93787JLAK. celteJL-2+iH2 


12.5 


CNS ea. (glio) 
SNB-19 


24.3 


93789JLAK celk JL-2+1FN 
gamma 


22.2 


CNS ca. (glio) 
U251 


20.9 


93790J-AK cellsJL-2+IL-l8 


15.7 


CNS ca, (glio) 
SF-295 


10.7 


^3104_LAK 

cells_PMAyionomycin and IL- 1 8 


4.0 


Heart (fetal) 


2.0 


93578_NK Cells IL-2_re5ting 


12.0 


Heart 


6.5 


93109_Mixcd Lymphocyte 
ReacttonJTwo Way MLR 


9.8 


Fetal Skeletal 


0.7 


93 1 l0_Mixed Lymphocyte 
Reaction Two Way MLR 


14.3 


Skeletal 
muscle 


18.3 


93 1 1 l_Mixed Lymphocyte 
Reaction Two Way MLR 


11.7 


Bone marrow 


11.2 


93 1 12_Mononuclear Cells 
(PBMCs)_testing 


5.8 


Thymus 


7,8 


93113 Mononuclear Cells 
(PBMCs)_PWM 


51.8 


^pieen 


IJL 


i ivionoxiucicar v^ens 
(PBMCs)_PHA-L 


X /, D 


Lymph node 


/ .3 


yjz4y_Kamos ceiij^_none 


A 1 ' 
U.Z 


Colorectal 


28.4 


93250 JRamos (B 
cell)_ionomycin 


6.7 


Stomach 


6.5 


93349_B lymphocytes^PWM 


"106.0 


Small intestine 


10.0 


93350JB lymphoytes_CD40L 
andIL4 


23.7 


Colon ca. 
SW480 


23.2 


92665_liOL-l 

CEosinophil)_dbcAMP 

differentiated 


15.5 


Colon ca.* 

(SW480 

met)SW620 


12.3 


93248_EOL-l 

(EosinophilLdbcAMP/PMAion 
omycin 


10.4 


Colon ca. 

error* 


1.7 


93356J)endritic Cells_none 


15.3 


Colon ca. ; 
HCT-116 


13.1 


93355_Dendritic Cells JLPS 100 
ng/ml 


6.7 


Colon ca. ! 
CaCo-2 


12.5 


93775 Dendritic Cells anti- 
CD40 


16.7 


83219 CC 
Well to Mod ! 


10.4 


93774_Monocytes_resting 


14.9 
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L>iir 

(OD03866) 








uoion ca. 
HCC-2998 


1 1 2 


i /oJMonocytesJJro 5U 
ng/ml 


A f\ 

4.1) 


Lrastnc ca,* 
(liver met) 
mpt MR7 


1 H £ 
1 /.0 


9J 5 a l_Macrophiiges_rebiing 


in r 


Bladder 


17.6 


93 582_Macrophages_LPS 100 
ng/ml 


7.5 


Trachea 


7.6 


93098_HUVEC 
icjmo lnejidi j_jiune 


33.0 


Kidney 


5.7 


93099 JIUVEC 
(Endothelial) starved 


6(5.0 


raaney (letalj 


C A 

6.4 


ai i aa trrT\roi*» 
93 1 00_HU VEC 

(Endothelial). IL-lb 


29.3 


Renal ca* 
786-0 




ftOTIn TIT T\ TT3 

93/79_IIUVIiC 
(Endothelial) IFN gamma 


33,0 


Renal ca. 
A498 


23.6 


93102JtiUVEC 

(Endothelial)_TNF alpha + IFN 
gamma 


23.0 


|Renal ca. 
RXF393 


37.8 


93101JBUVEC 
(EndothelialjjTNP alpha + IL4 


19.9 


■Renal ca. 
kcHN 


1 O 1 

18.1 


93781_HUVEC 
(Endothelial) IL41 


22.4 


[Renal ca. 
UO-31 


26.1 


93583_Lung Microvascular 
Endothelial Cells none 


27.5 


Renal ca. 
TK-10 


10.9 


93584_Lung Microvascular 

■"* J xt_ 1* 1 X> 11 < fK,TT"> / 4 

Endothelial Cells JTNFa (4 
ng/tnl) and ILlb (l ng/ml) 


25.2 


Liver 


2.2 


92662_Microvasculor Dermal 
endothelium none 


51.4 


Liver (fetal) 


4.4 


92663_Microsvasular Dermal 

1 , , «T,, 1 ? . . . , . TVil 1 .. ^ A f \\ 

endotnelium_T Nra (4 myml) 
and Hlb (1 ng/ml) 


22.5 


tiver ca. 

i(hepatoblast) 

HepG2 


28.1 


93773_Bronchial 

epithelium TNFa (4 ng/ml) and 

ILlb (1 ng/ml) ** 


17.0 


Lung 


12.4 


93347_Smali Airway 
Epitheliumnone 


20.9 


Lung (fetal) 


5.2 


93348_SmaIl Airway j 
Epithelium_TNPa (4 ng'ml) and 
ILlb (1 ng/ml) 


84.7 


Lung ca. 
(small cell) 
LX-1 


10.8 


92668_Coronery Artery 
SMC_resting 


37.9 


lung ca. 
(small cell) 
NCI-H69 


20.2 


92669_Coronery Artery 

SMC TNFa (4 ng/ml) and ILlb 

(lng/mQ_ _ _ . 


26.8 


Lung ca. 


20.1 


93 1 07 jisirocy tes_resting 


47.6~~ 
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(s.cell var.) 

CUP 77 








Lung ca. 
(large 

H460 


10.7 


931 0 8_astrocytes_TNFa (4 
ng/ml)andILlb (1 ng'ml) 


20.0 


Lung ca. (noii- 

bill. \Xllt r\J~rJ 


5.5 


92666JKXJ-812 

^OaaUpUlJ J ICbLlii^ 


25.0 


Lung ca. (non- 
s.cell)NCI- 


13.2 


92667JCU-812 
(Basophil)_PMA/ionoycin 


31.9 


Lung ca (non- 
s.cell) HOP- 

£7 


18.3 


93579_COD1106 
(Keiatinocytes)_noiie 


47.0 


Lung ca. (non- 
s.cl)NCI- 


13.8 


93580_CCD1I06 

(Kei-atinocytes)TNPa and IFNg 
** 


7.2 


tungca. 
'(scjuain*) 
|SW900 


172 


93791 JLiver Cirrhosis 


4.2 


Lung ca. 
[NCHI596 


20.9 


93792_Lupus Kidney 


3.8 


Mammary 
^land- 


17.8 


93577_NCI-H292 


55.1 


Breast ca.* 
(pL efflision) 
jYlv>r-/ 


20.4 


93358_NCI-H292_IL~4 


68.8 


Breast ca.* 
(pi.eij IV11/A- 
MB-231 


22.5 


93360 J*a-H292JI^9 


59.5 


'Breast ca.* 
(pi. effusion) 
T47D 


15.5 




37^9" 


Breast ca. 


135 


93357_NCI-H292_IFN gamma 


Breast ca. 

MDA-N__ 

Ovary 


9.9 


93777_HPAEC_- 


32.3 


alpha 


71 7 


vyvarian. ca. 
OVCAR-3 




Q^7^A ^lArmal Qitmnfi T nnrr 
7jZj*t XXUilXloi XlUlUaU J-zUIlg 

Fibroblast none 


70 7 

zy. / 


Ovarian ca. 
OVCAR-4 


12.4 


91253 Normal Wuman T urn? 
fibroblast TNFa (4 ngfoil) and 
IL-lb (1 ng/inl) 


16 8 


Ovarian ca. 
OVCAR-5 




93257JSJormal Human Lung 
Fibroblast IL-4 


55.5 


Ovarian ca. 
OVCAR-8 


9.0 


93256_Noimal Human Lung 
Fibroblast IL-9 


34.9 
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lOvarian ca, 
IGROV-l 


22.6 


93255_Normal Human Lung 
Fibroblast IL-13 


36.9 


Ovarian ca * 
(ascites) SK- 
OV-3 


100.0 


93258_Nonnal Hmnaii Lung 
Fibroblast_IFN gamma 


58.6 


Uterus 


6.1 


93 106_Derroal Fibroblasts 
CCD1070_resting 


742 


fclanccnta 
1 


15.8 


93361_Dennal Fibroblasts 
CCD1 070_TNF alpha 4 ng/ml 


642 


prostate 


5.6 


93105 Dermal Fibroblasts 
CCD1070 IL-1 beta 1 ngfail 


27.7 


Prostate ca,* 
'(bone met)PC- 
3 


50.1 


f>3772 jkamal fibroblastJFN 
gamma 


24.0 

* 


Testis 


83 


93771_damal fibroblast JL-4 




Melanoma 
|Hs688(A),T 


12.6 


93259JBD Colitis 1** 


2.8 


(Melanoma* 
Hs688(B).T 


14.5 


93260 IBD Colitis 2 


1.1 


Melanoma 
UACC-62 


18.8 


93261 JBD Crobns 


2.1 


Melanoma 
M14 


19.7 


735010_Colon,normal 


14.3 


Melanoma 
LOX IMVI 


8.8 


735019_Lung_none 


21.5 


Melanoma* 
(met) SK- 
MEL-5 


11.0 


64028-l_Thymus_none 


27.0 


Adipose 


14.5 


64030-l_Kidney_none 


31.2 



MOL2 is widely expressed in tissues and cell lines represented m both panels 1 .3D and 
4D, with highest expression being in one ovarian cancer cell line (SK-OV-3). Thus, it could 
serve as a diagnostic marlcer for ovarian cancer. 



MOL3 

Expression of MOL3 was assessed using the primer-probe sets Ag474 and Ag770, 
described in Tables 17 and 1 8. Results of the RTQ-PCR runs are shown in Tables 19 and 20. 

10 Table 17, Prube oaine: Ag474 
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Primers 


Sequences 


Length Start 

i Position 


SfvQ ID 
NO: 


Forward 


y-GGCACTGTCECTCTGCACAT-y 


20 j 493 


82 


Probe 


FAM-5*- 

CCCTGAGAAAGATCTGCCACAAAGACATCT 
G-3-TAMRA 


31 | 516 

1 


83 


Reverse 


5*-AACCTGCCCACAGAGCAATC-3' 


20 549 


84 



Table 18. Probe name: Ag770 



Primers 


Sequences 


Tm 


Lcngt 
h 


Start 

Positio 

n 


Forward 


5VACAGTGC1GTCGCTGGTACTT-3' 


60 


21 


717 


Probe 


FAM-5'- 

TTCGTACTGAAAGGCGTACCTCTCCA-3 - 
TAMRA 


67.9 


26 


746 


Reverse 


5*-CTCAAACAGCTCACGAGTGAT-3' 


58.1 J 


_21 j 


773 _ 



5 Table 19. Ag474 



PANEL 13D 




PANEL 2D 




Tissue Name 


Rel. Expr., % 
1.3Dtm3254fag474 


Tissue Name 


Rel. Expr., % 
2Dtm3255f ag47 
4 


Liver 

adenocarcinoma 


0.0 


Normal Colon GENPAK 
061003 


0.3 


Pancreas 


0.0 


83219 CC Well to Mod 
Diff(OD03866) 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


83220 CC NAT 
(OD03866) 


0.0 


Adrenal gland 


0.3 


83221 CC Gx.2 
rectosigmoid (OD03868) 


0.0 


Thyroid 


0.0 


83222 CCNAT 
(OD03868) 


0.0 


Salivary gland 


0.0 


83235 CC Mod Diff 
(ODO3920) 


0.0 


Pituitary gland 


0.0 


83236 CCNAT 
(ODO3920) 


0.0 


Brain (fetal) 


0.2 


83237 CC Gr.2 ascend 
colon (OD03921) 


0.0 


Brain (whole) 


0.0 


83238 CCNAT 
(OD03921) 


0.0 


Brain (amygdala) 


0.1 


83241 CCfrom Partial 
Hcpatcctomy (ODO4309) 


3.8 


Brain (cerebellum) 


. 0.0 


83242 Liver NAT 
(ODO4309) 


100.0 
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Brain 

(hippocampus) 


0.0 


87472 Colon mets to lung 
(OD04451-01) 


0.0 


Brain (substaniia 
nigra) 


0.0 


87473 Lung NAT 
(OD04451-02) 


0.0 


Brain (thalamus) 


0.0 


Normal Prostate Clontech 
A+ 6546-1 


0.0 


Cerebral Cortex 


0.1 


84140 Prostate Cancer 
(OD04410) 


0.0 


Spinal cord 


0.0 


84141 Prostate NAT 
(OD04410) 


0.0 


CNS ca. (glio/aslro) 
U87-MG 


0.3 


87073 Prostate Cancer 
j(OD04720-01) 


0.0 


CNS ca. 

(glio/aslro) IM18- 
MG 


0.0 


87074 Prostate NAT 
(OD04720-02) 


0.0 


CNS ca. (astro) 
SW1783 


0.0 


Normal Lung GENPAK 
061010 


0.0 


CNS ca.* (neuro; 
met ) SK-N-AS 


0.0 


83239 Lung Met to Muscle 
(ODC4286) 


0.0 


CNS ca. (astro) 
SF-539 

CNS ca, (astro) 
SNB^75 


0.0 

~0.0~ ~~ ~ 


83240 Muscle NAT 
(0DO4286) 


0.0 


84136 Lung Malignant 
Cancer (OD03126) 


0.0 


CNS ca. (glio) 
SNB-19 


0.0 


84137 Lung NAT 
(OD03126) 


0.0 


CNS ca. (glio) 
U251 


0.0 


84871 Lung Cancer 
(OD04404) 


0.0 


CNS ca. (glio) 
SF-295 

Heart (fetal) 


0.0 

._ k_ ._ 


84872 Lung NAT 
(OD04404) 


0.0 


84875 Lung Cancer 
lOD045(b) 


0.0 


Heart 


0.0 


84876 Lung NAT 
(OD04563) 


0.0 


Fetal Skeletal 


0.2 


85950 Lung Cancer 
(OD04237-01) 


0.0 


Skeletal muscle 


0.0 


85970 Lung NAT 
(QD04237-02) 


0.0 

1 


Bone marrow 


0.0 


83255 Ocular Mel Met to 
Liver (ODO4310) 


0.0 , 


Thymus 


0.0 


83256 Liver NAT 
tODO43l0) 


64.2 


Spleen 


02 


84139 Melanoma Mets lo 

Ming ^*JJJU4.12J ) 


0.0 

l 


Lymph node 


0.0 


84138 Lung NAT 
(OD04321) 


0.0 


Colorectal 


0.0 


Normal Kidney GENPAK 
061008 


0.0 


Stomach 


0.1 


83786 Kidney Ca, 
Nuclear grade 2 


0.0 
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is mall intestine 


0.0 


83 7 87 Kidney NAT 
(OD04338J 


0.0 


.Colon ca. 
SW480 


03 


83788 Kidney Ca Nuclear 
grade 1/2(OD04339) 


0.0 


Colon ca.*(SW480 
raet)SWo20 


0.0 


83789 Kidney NAT 
(ODU4339) 


0.0 


Colon ca. 
HT29 


0.0 


83790 Kidney Ca, Clear 
cefltype(OD04340) 


0.1 


Colon ca. 
HCT-116 
Colon ca. 
CaCo-2 


0.0 
0.3 


83791 Kidney NAT 
(OD04340) 

83792 Kidney Ca, Nuclear 
grade3(OD04348) 


0.0 
0.0 


83219 CC Well to 

ModDiff 

(OD038dd) 


0.3 


83793 Kidney NAT 
(UD04348) 


0.0 


Colon ca. 
HCC-2998 


0.5 


87474 Kidney Cancer 
(OD04622-01) 


0.0 


Gastric ca.* (liver 
met)Nd-N87 


0.0 


87475 Kidney NAT 

/AT\A 4 A1\ 

(OD04622-03) 


0.0 


Til f J 

Bladder 


0.0 


85973 Kidney Cancer 
(OD04450-Q1) 


1 0.0 


Trachea 


0.0 


85974 Kidney NAT 
(OD04450-03) 


0.1 


Kidney 

Kidney (tetal) 


0.0 


Kidney Cancer Clontech 
8120607 


0.0 


0.0 


Kidney NAT Clonlech 
8120608 


0.0 


Renal ca. 

7864) _ _ _ 
Renal ca. 

A /f AO 

A498 


0.0 


Kidney Cancer Clontech 
8120013 


0.0 


0.0 


Kidney NAT Clontech 
8120614 


0.0 


Renal ca. 
RXF393 


0.0 


Kidney Cancer Clontech 

A A 1 A*>A A 

9010320 


0.0 


Renal ca. 
ACHN 


0.0 


Kidney NAT Clontech 

A A f A'il 1 

9010321 


0.0 


Renal ca. 
UO-31 


0.3 


Normal Uterus GENPAK 

f\£ 1 A1 O 

061018 


0.0 


Renal ca. 
TK-10 


0.1 


Ijterus Cancer GENPAK 
064011 


0.0 


Liver 


100.0 


Normal Thyroid Clontech 
A+ 6570-1 


ft A 

0.0 


Liver (fetal) 


1 o 
1.6 


Inyroiu Cancer uJiNrAK. 
064010 


tl A 

u.u 


Liver ca, 

(hepaioblasl) 

IIepG2 


0.0 


Tliyroid Cancer 
INVITROGENA302152 


0.0 


L «ng... . 


0.0 


Thyroid NAT 
JNVITROGENA302153 


0.0 
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Lune ftetaft 


0.0 


Normal Breast GENPAK 
061019 


0.0 


Lung ca. (small 
cell) LX-1 


0.0 


84877 Breast Cancer 
(OD04556) 


0.0 


Lung ca. (small 
cell) NCI-H69 


0.0 


85975 Breast Cancer 
(OD04590-01) 


0.0 


Lung ca. (s.cell 
vat.) SHP-77 


0.0 


85976 Breast Cancer Mets 
(OD04590-03) 


0.0 


Lung ca, (large 
pe!l)NCI-H460 


0.0 


87070 Breast Cancer 
Metastasis (OD04655-05) 


0.0 


lung ca, (non-snx 
Icell) A549 


0.1 


GENPAK Breast Cancer 
064006 


0.2 


Lung ca. (nan- 
's celHNCI-H23 


0.2 Breast Cancer Res. Gen. 
11024 


0.2 


Lung ca (non-s.cell) 
IIOP-62 


0.0 Breast Cancer Clontech 
9100266 


0.0 


Lung ca. (non-sicl) 
NCI-H^22 

A ™ \SA A Lv few 

,Lung ca. (squam.) 
SAY 900 


0.0 Breast NAT Clonf cch 
9100265 


0.0 


0.0 Breast Cancer 

INV1TROGEN A209073 


0.0 


[Lung ca. (squam.) 
INCI-H596 


0.0 Breast NAT 

INVITROGEN A2090734 


0.0 


MaiTirviaTV eland 


0.0 formal Liver GENPAK 
061009 


38.4 


Breast ca.* (pL 
effusion) MCF-7 


0.0 


Liver Cancer GENPAK 
064003 


49.0 


01. Gaol UL ^JjLCLJ 

MDA-MB-231 


0.0 


Liver Cancer Research 
Genetics RNA 1025 


6.1 & 


Breast ca.* (pi 
pffiisirm^ T47T} 


0.0 


Liver Cancer Research 
Genetics RNA 1026 


12.9 


Breast ca. 


0.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6004-T 


74.2 


Breast ca. 


0.1 


Paired Liver Tissue 
Research Genetics RNA 
6004-N 


8.5 


Ovary 


0.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6005-T 


5.0 


Ovarian ca. 
OVCAR-3 


0.5 


Paired Liver Tissue 
Research Genetics RNA 
6005-N 


IK 6 


Ovarian ca. 
OVCAR-4 


0.0 


Normal Bladder GENPAK 
061001 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


Bladder Cancer Research 
Genetics RNA 1023 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bladder Cancer 
pjVTIROGEN A302173 


0.0 


Ovarian ca. 


0.0 


87071 Bladder Cancer 


0.0 
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IGROV-l 




(OD0471S-01) 




Ovarian ra * 

(ascites) SK-OV-3 


0.0 

— ouT 


87072 Bladder Normal 
Adjacent (OD04718-03) 


0.0 


Uterus 


Normal Ovarv Ren Cren 


00 

vr.v 


Placenta 


0.0 


Ovarian Cancer GENPAK 
064008 


0.0 


Prostate 


0.1 


87492 Ovary Cancer 
(OD04768-07) 


0.1 


Prostate ca.* (bone 
mefiPC-3 


0.0 


87493 Ovary NAT 
(OD04768-08) 


0.0 


Testis 

X wP U9 


0.3 


Normal Stomach GENPAK 
061017 


0.0 


Melanoma 
Ks68S(A).T 


0.0 kjastric Cancer Clontech 
9060358 


0.0 


Melanoma* (met) 


0.0 NAT Stomach Clontech 
9060359 


0.0 


Melanoma 
tJACC-62 


0.0 Gastric Cancer Clontech 
P060395 


0.4 


Melanoma 
M14 

Melanoma 
LOXIMVI 


0.0 
0.0 ' 


NAT Stomach Clontech 
9060394 


. 0.0 


Gastric Cancer Clontech 
9060397 


0.3 


Melanoma* (met) 
SK-MEL-5 


0.0 


NAT Stomach Clontech 
9060396 


0.0 


1 

|Adipose 


0.0 


Gastric Cancer GENPAK 
064005 


0.0 

1 



Table 20. Ag770 



PANEL 1 




PANEL 4D 






Tissue Name 
1 


RcL Expr., % 
1.3dx4tm5495 
fag770J>2 


Tissue Name 


ReL Expr., % 
4dtmlS43f ag7 
70 


RcL Expr., 
% 

4Dtml910f 
ag770 


Liver 

adenocarcinoma 


O.O 


93768_Secondary 
Thl_anti-CD28/aoti-CD3 


0.0 


0.0 


iPancreas 

1 


0.0 


93769_Secondaiy 
Th2_anti-CD28/anti-CD3 


6.4 


0.0 
~ ~0.6 


Pancreatic ca. 
CAPAN2 


0.0 


93770_Secondary 
Trl_anti-CD28/anti-CD3 


3.1 


Adrenal gland 


0.0 


93573_Secondary 

Thl resting day 4-6 in IL- 

2 


0.0 


0.0 


Thyroid 


0.0 


93572_Secondaiy 

Th2 resting day 4-6 in IL- 

2 


0.0 


0.0 


Salivary gland 


0.0 


93571_Secondary 
Trl_resting day 4-6 in IL-2 


0.0 


0.0 
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Pituitary gland 


0.0 


93568 jDrimary Thljuiti- 
CD28/anli-CD3 


3.7 


0.0 


Brain (fetal) 


0.0 


93569 primary Th2jmti- 
CD28/anti-CD3 


0.0 


0.0 


Brain (whole) 


0.0 


93570_primaiyTTl anti- 
CD28/anti-CD3 


0.0 


0.0 


Brain 

(amygdala) 


0.0 


93565_primary 
Thljresting dy 4-6 in 1L-2 


0.0 


4.8 


Brain 

(cerebellum) 


0.2 


93566 Diiniarv 

Tb2 jesting dy 4-6 in IL-2 


0.0 


22 


Brain 

(hippocampus) 


0.1 


93567j>rimary Trl j-esting 
Qy in ll-z 


0.0 


7.0 


Brain 

(substantia 

nigra) 


0.0 


93351_CD45RACD4 

lymphocyte_anti- 

CD28/anti-CT>3 


3.1 


0.0 


Brain 

(thalamus) 


0.0 


93352_CD45RO CD4 

lymphocyte_anti- 

CD28/anti-CD3 


0.0 


0.0 


Cerebral Cortex 


0.0 


93251_CD8 

Lymphocytes_anti- 

CD28/anti-CD3 


1.4 


0.0 


Spinal cord 

1 


0.0 


93353_chromc CDS 
Lymphocytes 2i}_resting 
day 4-6 in IL-2 


0.0 


0.0 


CNS ca. 

(glio/astro) 

XJ87-MG 


0.0 


93574_chronic CDS 

Lymphocytes 

2ry activated CD3/CD28 


0.0 


0.0 


,CNS ca. 
XgUo/astro) . U- 
118-MG 


0.0 


93354_CD4jionc 


0.0 


0.0 


fcNS ca, (astro) 
oVvl Zoo 


0.0 


93252_Secondary 

TliI iTUO /T^l rrr\nc 

I nl/ 1 HZ/ It I anli-\^l^yj 

cnu 


0.0 


41.2 


CNS ca * 
i(neuro; met ) 
SK-N-AS 


0 0 


93103 IAK cells restine 


00 


00 


CNS ca. (astro) 
SF-539 


0.0 


93788JLAKcelkJL-2 


0.0 


0.0 


CNS ca. (astro) 
SNB-75 


0.0 


93787 LAKcellsJL- 
2+IL-12 


0.0 


0.0 


CNS ca. (glio) 
ISNB-19 


0.0 


93789_LAK cells JL- 
2+IFN gamma [ 


0.0 


0.0 


CNS ca. (glio) 
U251 


0.0 


93790 LAK cells IT^2+ 
IL-18 


0.0 


0.0 


CNS ca. (glio) 
SF-295 


0.0 


93104_LAK 

cells PMA/ionomycin and 
IL-18 


0.0 


5.9 
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jHeart (fetal) 


0.0 


93578_NK Cells IL- 
2_resting 


0.0 


0.0 


Heart 


0.3 


93 109 JVfixed Lymphocyte 
ReactionJTwo Way MLR 


6.0 


0.0 


Fetal Skeletal 


0.0 


93 1 10 JVfixed Lymphocyte 
ReactionJTwo Way MLR 


0.0 


0.0 


Skeletal muscle 


0.0 


931 1 l_Mixed Lymphocyte 
ReactionJTwo Way MLR 


0.0 


1.3 


Bone marrow 

i 


0.0 


93 1 12 JViononoclear Cells 
(PBMCs)_restmg 


0.0 


18.4 


Thymus 

i 


0.0 


93113 Mononuclear Cells 
(PBMCs)_PWM 


0.0 


0.0 


Spleen 


0.0 


931 ^Mononuclear Cells 
(PBMCsLPHA-L 


O.O 


6.6 


[Lymph node 

i 


0.0 


03249 JRamos (B 
cell)_none 


5.7 


0.0 


/liilnrecliil 


0.1 


932*50 Ramos (R 
cell)_ionomycin 


00 


00 


Stomach 


0.0 


^3349_B 

lyiiipiiocy te s_r w M 


0.0 


0.0 


Small intestine 


0.0 


93350 JB 

lymphoytes_CD40L and 
IL4 


, 0.0 


0.0 

1 


Colon ca. 


0.0 


9266SJBOL-1 

(nOSHlopmi J^uDCAMr 

differentiated 


0.0 


3.2 

! 


Colon ca * 

(SW480 

met)SW620 


0.2 


93248 EOL-l 

(Eosinophi^dbcAMP/PM 

Aiononiycin 


0.0 


o.o i 

i 


Colon ca. 
HT29 


0.0 


93356_Dendrittc 
CeDsjsone 


0.0 


0.0 ! 


Colon ca 
HCT-116 


0.0 


93355 Dendritic 
Cclls_LPS100ng/ml 


0.0 


0.0 


Colon ca 
CaCo-2 


0.0 


93775 Dendritic 
Cells_anti-CD40 


0.0 


0.0 


83219 CC Well 
10 jYLOU jjlu 

(OD03866) 


0.0 


93774_Mouocytesjresting 


0.0 


0.0 


Colon ca. 
HCC-2998 


1.3 


93776_Monocytes_LPS 50 
jng/nd 


0.0 


0.0 


Gastric ca.* 
(liver met) NCI- 
NS7 


0.2 


93581 JMacrophages_refctin 
S 


1.1 


0.0 


Bladder 


0.0 


93582 JVlacrophagesJLPS 
lOOng/ml 


0.0 


5S 


Trachea 


0.0 


?3098_HUVEC 
(Endothe1ial)_none 


0.0 


0.0 
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Kidney 


0.0 


93099 HUVEC 
(Endothelial)jstarved 


0.0 


0.0 


Kidney (f&toal) 


0.0 


93100 HUVEC 
(EndotheIiaI)JOL-lb 


0.0 


0.0 


Renal ca. 


0.0 


&779.HUVEC 


0.0 


0.0 


Renal ca. 


0.5 


93102JTOVFC 

^EllUUUlGllUlJ^J rir alJJiiti ~ 

IFN ^amma 


0.0 


0.0 


Renal ca. 

JxAJF J*? 

1 


0.0 


93101JIUVEC 
^unaoLneiiai^ iivr aipnaT 
IL4 


0.0 


o.o 


[Renal ca. 
•ACHN 


0.0 


9378I_HUVEC 
CBndotMial)JL-ll 


0.0 


0.0 


Renal ca. 
U031 


0.0 


93583_Lung 

Microvascular Endothelial 
Cells none 


0.0 


0.0 


.Renal ca. 

Itk-io 


0.0 


93584_Lung 

Microvascular Endothelial 
CeUsJiNFa(4ng/nil)and 

JLL.1D ^1 ngiUllj 


0.0 


0.0 


Liver 


100.0 


92662_Microvascular 
Dermal endotheliuinjioae 


0.0 


0.0 


Liver (fetal) 


0.3 


92663_Microsvasular 
Dermal endotheUumJTNFa 
\ft ng/nu/ ana ulid ^1 
ng/ml) 


0.0 


0.0 


Liver ca. 
(hepato b last) 
HcpG2 


0.0 


93773_Bronchial 

e pimei lurri 1 in 1 * a y+ ngr eq j j 

andTLlb(l ng/ml)** 


0.0 


2.2 


r imp 


0.0 


93347 Small Airwnv 
Epfthelhimjaone 


0.0 


0.0 


Lung (fetal) 


0.0 


93348 JSmall Airway 
cpiineiiurn^ 1 rsir a \*\ 
ng/ml) and ILlb (1 ng/ml) 


0.0 


3.7 


1 hup ca fcmall 

cell) LX-1 


0.0 


92668 Coronerv Arterv 
SMC_resting 


0.0 


0.0 


Lun& ca. (small 
cell) NCI-H69 


0.0 


92669 Coronerv Artery 
SMC TNFa(4ng/ml)and 
ILlb(lngtal) 


0.0 


0.0 


Lutig ca. (sxcll 
vai.) SHP-77 


0.0 


93 107_astrccytes_resting 


0.0 


3.4 


Lung ca. (large 
cell)NCI-H460 


0.0 


93108_astrocytes_TNFa (4 
ng'ml) and ILlb (1 ng/inl) 


0.0 


0.0 


Luag ca. (non- 
sin. celI)A549 


0.0 


92666JCU-812 
(Basopliil)_resting 


0.0 


1.6 


Lung ca. (non- 
s.cell)NCI-H23 


0.0 


92667JOJ-812 
(Basophil)_PMA/ionoycin 


0.0 


0.0 
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Lang ca (lion- 

S.CollJ HUr-OZ 


0.0 


93579„CCDU06 
(js£id uno cy ies i^jQon© 


0.0 


0.0 


Lung ca. (non- 
s.cl)NCI-H522 


0.0 


935«0_CCD1106 
(Kcratin(>cytes)_TNPa and 
IFNg** 


0.0 


0.0 


Luug ca. 

^niiam \ QTVT 

jsquam .) a w 
900 


0.5 


93791 Jiver Cirrhosis 


100.0 


100.0 


Lung ca. 
isquamj iNui- 
H596 


0.0 


93792_Lupus Kidney 


0.0 


0.0 


Mammary gland 


6.0 


93577_NCI-H292 


0.0 


0.0 


Breast ca.* ^pi- 
effusion) MCF- 
* 


0.0 


9335SJ*CI-H292_IL-4 


0.0 


0.0 


Breast ca.* 

Ipl.ei) JVLUA- 
MB-231 


! 0.0 


93360_NCWB292_IL-9 


0.0 


0.0 


RtY*acf * fn] 

XJXLytl^t, v>iJ- . ^ £_ri. 

pffusion) 
T47D 


0.0 


91359 NO-H292 TL-13 


0.0 


0.0 


Breast ca. 
BT-549 


0.0 


93357_Na-H292JPN 
gamma 


0.0 


0.0 


Breast ca. 
MDA-N 


0.0 


93777_HPAEC_- 


2.3 


3.4 


\J VOI.JT 


0.0 


9*778 HPAEC IL-l 

7>j t f KJ X XX X»_Lj\_> XX^» X 

beta/TNA alpha 


0.0 


0.0 


Ovftrinn (\a 

OVCAR-3 


0.0 


032 54 Normal Human 
Lung Fibroblast_none 


3.4 


0.0 


Ovarifiri 
OVCAR-4 


0.0 


93253 Normal FTiiman 
Lung Fibroblasi^TNFa (4 
ng/ml) and IL-lb (1 ng/ml) 


0.0 


0.0 | 

1 


Ovarian ca. 
OVCAR-5 


0.0 


93257JNfomiaI lluman 
Lung FibroblaBML-4 


0.0 


0.0 | 


Ovarian ca 

^-r T Mill WU> 

OVCAR-8 


0.0 


93256 Normal Human 
Lung Fibroblast JL-9 


0.0 


0.0 


Ovarian ca 
1GROV-1 


0.0 


9^255 Normal Human 
Lung Fibroblast_IL-13 


0.0 


0.0 


Ovarian ca.* 

(ascites^ SK- 

OV-3 


0.0 


93258_Normal Human 
Lime Fibroblast 1FN 
gamma 


0.0 


0.0 


Uterus 


0.5 


93106_Permal Fibroblasts 
CCD1070_restmg 


0.0 


0.0 


Plancenta 


0.0 


93361 Dermal Fibroblasts 
COD1070_TNF alpha 4 
ng/ml 


0.0 


0.0 

i 
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"Prostate 

i 


o.o 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 
ng/ml 


0.0 


4.3 


Prostate ca * 
(bone met)PC-3 


0.0 


93772 jiermal 
SbroblastJFN gamma 


0,0 


7.9 


Testis 


0.4 


93771 dermal 
fibrobbst_IL-4 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


93259JBD Colitis 1** 


6.0 


13.1 


Melanoma* 
(rast) 

IIs688(B).T 


0.0 


93260 IBD Colitis 2 


2.8 


52 


Melanoma 
UACC-62 


0.3 


93261 JBDCrohns 


0.0 


1.9 


Melanoma 
M14 

Melanoma 
LOXTMV1 


0.1 


735010 Colon normal 


1.3 


32 


0.0 


TSSOWJUrngjaone 


14.9 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.4 


6402R-l_Thyrmus_none 


23 


7.3 


Adipose 


0.0 


64030-l_Kidney_none 


0.0 


0.0 



Both probe/primer sets arc specific for the sequence of gene Acc. No. MOL3. Unigene 
data at http:y/ww.ncbi.nh^ 

and our RTQ-PCR panels 1 .3D,2D and 4D indicate that this gene is specifically 
5 expressed by the liver and upregulated in some hepatocellular carcinomas (HCCs).TIiere is 

evidence suggesting that the examination of the serum complements may be a useful tool for the 
detection of HCCs in liver cirrosis (LC) patients. (Takczafci E, Murakami S, Nishibayashi H, 
Kagawa K, Ohmori H. GanNo Rinsho 1990 Oct;36(12):21 19-22). Therefore the serum level of 
this protein can be used as a diagnostic marker to detect LC and HCC and antibodies directed 
10 against this protein can be a potential therapeutic agent against LC and HCC; In addition, this 
molecule can also serve as a specific marker for differentiating liver from other tissues. 



MOL4 

15 Expression of gene MOL4 was assessed using the primer-probe set Agl61 1 , described in 

Table 21 
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Table 21. Probe Name: Agl611 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 

NO: 


Forward 


5'-ATATGCTGTGCTGCATTCAGT-3' 
TET-5'- 

CTGCCTGGTCAGTGAACAATTTCCTG- 
3-TAMRA 


58.4 
68.9 


21 
"26 


40 


85 


Probe 


65 


86 


Reverse 


5'-CAAGGCCACACTAGTCGTGTAG-3* 


59.9 


22 


117 


87 



Expression of this gene in panels 1 .3D, 4D and 2 is at very low to undetectable levels (Ct 
5 values>35) in a number of tissues, 

MOL6 

Expression of geue MOL6a was assessed using the primer-probe set Agil67, described 
in Table 22. Results of the RTQ-PCR runs are shown in Tables 23 and 24. 
10 Table 22. Probe Name: Agll67 



Primers 


Sequences j Tm 


Lcngt 
h 


Start 
Position 


SEQID 

NO: 


Forward 


5'-TCTTTGCTGACTCATCTGTTCA-3' | 58.7 


22 


51 


88 


Probe 


TF.T-5'- , 67.5 
AAGAAGACCCTGCTCCCTATTTGGTG- 1 
3-TAMRA j 


26 


75 


89 


Rever&e 


S'-AGGGGTTGAAGTCAGACTTGAG-S' , 59.8 


22 


104 


90 



Table 23, Panels 1.3D and 4D 



PANEL 13D 




PANEL 4D 


i i 


Tissue Name 


Rel. Expr., 
% 

1.3dx4trn55 
86t agli67 
al 


Tissue Name 


Rel. Expr. j 
% j 
4Dtml937tj 
_agll67 


Liver 

adenocarcinoma 


1.1 


93768 Secondary Thl anti-CD28/anti- 
CD3 


1.7 


Pancreas 


0.0 


93769 Secondary Th2 anti-CD28/anti- 
CD3 


0.0 


PaaoTeaLic ca. 
CAP AN 2 


1.0 


93770 Secondary Trl anii~CD28/anu- 
CD3 


0.9 


Adrenal gland 


0.0 


93573_Secondary Thl_resting day 4-6 
inlL-2 


2.7 
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Thyroid 


0.0 


93572_Secondary Th2_restiiig day 4-6 
mIL-2 


0.0 


Salivary gland 


0.0 


9357 ^Secondary Trl_resling day 4-6 


0.0 


Pituitary gland 


2.1 


93568 jrimary Thl__anti-CD28/anti- 


0.0 


Brain (fetal) 


4.1 


93569_primary Th2_anti-CD28/anti- 


3.0 


Brain (whole) 


1.1 


93570 ^primary Trl_anti-CD28/anti- 


2.4 


Brain (amygdala) 


4.1 


93565_jprixaary Thl_rei>ting dy 4-6 in 

IL-2 


5.3 


Brain (cerebellum) 


6.7 


93566 jrimary Tb2_resting dy 4-6 in 
IL-2 


3.7 


Brain (hippocampus) 


4.3 


93567_primary Trl_resting dy 4-6 in 
IL-2 


3.7 


Brain (substantia 
nigra) 


5.4 


93351_CD45RACD4 
lyTnpnocyte_anti -CD28/anti-CD3 


1.6 


Brain (thalamus) 


4.7 


93352_CD45ROCD4 
lyinphocyte_anti-CD28/anri-CD3 


2.6 


Cerebral Cortex 


1.3 


93251_CD8 Lymphocytes_anti- 
CD2S/anti-CD3 


0.9 


Spinal cord 


1.6 


93353_chronic CDS Lymphocytes 
2ry_resting dy 4-6 in IL-2 


0.4 


CNS ca. (glio/astro) 
U87-MG 


0.0 


93574_chronic CDS Lymphocytes 
2iy_activated CD3/CD28 


0.9 


CNS ca. (glio/astro) 

y.f-1 j o-ivivj 


0.0 


93354_CD4_none 


1.0 


CNS ca. (astro) 


0.0 


93252_Secondary Thl/Th2/Trl_anti- 

PTW? PH1 1 


2.5 


CNS ca.* (neuro; met 

\ C*XT "XT A o 

) SK-N-A& 


0.0 


93103 LAK cellsrestmg 


7.0 


CNS ca. (astro) 


0.0 


93788_LAK.cells_IL-2 


0.9 


CNS ca. (astro) 

SNB-75 
will* / 


2.4 


93787_LAK cells_IL-2HL-12 


1.6 


CNS ca. (glio) 
SNB-19 


4.3 


93789JAK oells_IL-2+IFN gamma 


5.8 


CNS ca, (glio) 


1.3 


93790JLAK cells JL-2+ JL-l 8 


1.7 


CNS ca. (glio) 


0.0 


93 104JLAK cells_PMManomycm 
ana llt i o 


0.0 


Heart (fetal) 


0.0 


93578_NK Cells IL-2_rcsting 


2.2 


Heart 


0.0 


93 l09_Mixed Lymphocyte 
Reaction_Two Way MLR 


2.4 


Fetal Skeletal 


0.0 


93 1 lOJVCxed Lymphocyte 
ReactionTwo Way MLR 


1.6 
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Skeletal muscle 


1.0 


931 1 l_Mixed Lymphocyte 

Ivvawliun 1 hu it cxy lYJJ_.lv 


0.0 


Bone marrow 


0.0 


93 1 ^Mononuclear Cells 
(rBMCs)_resting 


0.0 


Thymus 


0.0 


93 1 ^Mononuclear Cells 
(PBMCs)_PWM 


6.0 


Spleen 


16.0 


93114 Mononuclear Cells 
(PBMCs)_PHA-L 


0.9 


f vrvuVh nnrlf* 


1 4 


93249 Ramos fR celh Tinne 




Colorectal 


0.0 


93250JRamos (B cell)_ionomycrn 


5.9 


Stomach 


0.0 


93349_B lymphocytes_PWM 


3.8 


Small intestine 


2.9 


93350^6 rymphoytcis_CD40L andTL-4 


2.4 


p)lon ca. 
pW4w 


0.0 


92665JEOH (Eosinophil)_dbcAMP 
differentiated 


82.9 


Colon ca*(SW480 
metjNWcZU 


1.7 


93248_EOL-l 

Qfcosmopiui )_Q0CAMri riVlAiononiycui 


100.0 


Colon ca. 


0.0 


93356_Dendritic Cellsjione 


2.5 


Colon ca, 

[HCT- 1 lo_ 


3.9 


93355_Dendritic CellsJJPS 100 ng/ml 


0.9 


Coloa ca. 
CaCo-2 


1.0 


93775 ^Dendritic Cells_anti-CD40 


2.5 


83219 CC Well to 
ModDiff(OD03866) 


0.0 


93774_Monocytes_resting 


0.0 


Colon ca. 
HCC-2998 


1.9 


93776_Monocytcs_LPS 50 ng/ml 


0.8 


Gastric ca.* (liver 
met) Na-N87 _ 
Bladder 


2.3 
4.9 


9358 1_ Macrophages jtesting 


0.8 


93582 Jtfacrophages_LPS lOOng/rnl 


0.4 


Trachea 


0.0 


93098 JiUVEC (Bndothelial)_none 


0.9 


Kidney 


1.8 


93099 JIUVEC (Endotlielial)_starved 


1.5 


Kidney (fetal) 


2.3 


93100JIUVEC (Endothelial) JL-lb 


0.0 


Renal ca. 
786-0 


0.0 


93779_HUVEC (Endothelkl)JFN 
gamma 


2.4 


Renal ca. 
A498 


0.9 


93102_HUVEC (Eadothelial)_TOF 
alpha + IFN gamma 


0.0 


Renal ca. 
RXF393 


0.0 


93101_HUVEC (EndotheIial)_TNF 
alpha + IL4 


0.0 


Renal ca. 
ACHN 


1.5 


93781_HUVEC (EodotlieIial)_IL-ll 


0.9 
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Renal ca. 
UO-3L 


0.0 


93583_Lung Microvascular BndotheliaJ 
Cells_none 


2.1 


Renal ca. 

ra-io 


4.0 


93584 Lung Microvascular Endothelial 
CellsJTNFa (4 ng/ml) and ILlb (1 
ng'inl) 


1.3 


Liver 


0.0 


92662_MicTovascular Dermal 
endothelium none 


J. 2 


Liver (fetal) 


0.0 


92663_Microsvasular Dermal 
endothehum TNFa (4 ng/ml) and ILlb 
(In^ml) 


1.5 


Liver ca. 

(hepatoblast) HepG2 


0.5 


93773_Bronchial epilhelium_TNFa (4 
ng/inO and ILlb (1 ng/ml) ** 


44.1 


Lung 


0.0 


93347JSmall Airway Epitheliumnone 


5.6 


Lung (fetal) 


2.2 


93348_Snmll Airway 

Epithelium TNFa (4 ng/ml) and ILlb 

(1 ng/ml) 


25.7 


Lung ca. (small cell) 
LX-1 


5.2 


92668_Coronery Artery SMC_resting 


0.7 


NCI-H69 


2.8 


^2669 Coroiierv Arterv SMC TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.8 


93 1 07_asu*ocytes_resting 


3.5 


Lunfz ca. flarfre 
cell)NCI-H460 


2.0 


93 1 08 astrocytes TNFa C4 no/mft and 
ILlb (T ng/ml) 


1.4 


Lung ca. (non-sm. 
cell)A549 


1.8 


92666_KU-812 (Basophil)_resting 


20.4 


Lung ca. (non-s.cell) 
NCI-H23 


4.7 


92667_KU-812 
(Baaophil)_PMA/ianoycin 


34.2 


Lung ca (non-s.cell) 
HOP-62 


1.3 


93579_CCDI 106 (Kcratiiiocytes)_none 


0.7 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


93580_CCD1106 

(KeL'aticocytes)_TNTa and IFNg ** 


19.1 


Lung ca. (squam.) 
SW90G 


0.0 


93791_Liver Cinfeosis 


3.2 


Lung ca. (squam.) 
NQ-H596 


12 


93792_Lapus Kidney 


52 


Mammary gland 


0.8 


93577 NCI-H292 


6.3 


Breast ca.* (pi. 
effusion) MCF-7 


3.8 


93358_NCl-H292_IL-4 


7.4 


Breast ca.* (pl.ef) 
MDA-MB-23L 


0.0 


93360_Na-H292_IL-9 


6.5 


Breast ca.* (pi. 
effusion) T47D 


0.0 


93359_NCI-H292_IL-13 


8.8 


Breast ca. 
BT-549 


0.0 


933S7_NCI-H292_IFN gamma 


4.2 
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iBreast ca. 
|MDA-N 


U.O 


93777JHPAbC_- 


0.9 


Ovary 


0.0 


93778_HPAECJL-1 beta/TNA alpha 


1.4 


Ovarian ca. 
OVCAR-3 


0.0 

- 1 


93254JMormal Human Lung 
fibroblast none 


0.0 


rr . 

Ovanan ca. 
OVCAJM 


1.6 


93253_Normal Human Lung 
FibroblastJTNFa (4 ng'ml) and IL-lb 
(1 ngfail) 


0.0 


Ovarian ca, 
OVCAR-5 


1.9 


93257_Normal Human Lung 
Fibroblast. IL-4 


1.5 


Ovarian ca. 
DVCAR-8 


2.8 


9325 6_Nornial Human Lung 
Fibroblast IL-9 


0.0 


Ovarian ca. 
IGROV-1 


2.7 


93255_Normal Human Lung 
Fibroblast IL-13 


J. 9 


Ovarian ca.* (ascites) 
SK-OV-3 


5.6 


93258_Noimal Human Lung 
Fibroblast_IFN gamma 


22 


Uterus 


0.6 


93106JDermal Fibroblasts 
CCD1070_resting 


0.8 


Plancenta 


1.0 


93361J>rmal Fibroblast 
CCD1070 TNF alpha 4 ng/ml 


1.6 1 


Prostate 


0.0 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 eg/ml 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


93772_deimal fibroblastJFN gamma 


2.0 


Testis 


10O.O 


93771_dermalfibroblastJL-4 


0.9 


Melanoma 
Hs688(A).T 


0.0 


93259JBD Colitis 1** 


21.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


9326GJBD Colitis 2 


1.3 


Melanoma 
UACC-62 


0.0 


93261 JBD Crohns 


0.7 


Melanoma 
M14 


0.9 


73501 0_Lolon_nomial 


0.7 


Melanoma 
LOXIMVI 


0.0 


73501 9_Lung_none 


12 


Melanoma* (met) 
SK-MEL-5 


0.0 


64028-l_Thymus_none 


41.8 


Adipose 


1.9 


64030-l_Kidney_none 


6.0 



Table 24. Panels 2D and 3D 



PANEL 2D 




PANEL 31> 




Tissue Name 


Rel. Expr., 
% 


Tissue Name 


Rel. Expr., 

% 
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2Dtm2324t 
agll67 




3dtm5309t 
J2gl)67 


NunnaJ Colon 
GENPAK 061003 


z,y 


949u5_Daoy_Meauuoblasioma/Cerebell 
urn sscDNA 


4.0 


83219 CC Well to 
Mod Dm (0DO3 866) 


2.5 


94906_TE67LMedullotlastora/Cercbcl 
lum sscDNA 


0.0 


83220 CC NAT 
(OD03866) 


18.9 


94907_D283 

Med Medulloblastoma/Cerebellum ssc 
DNA 


8.4 


83221 CC Qr2 

rectosigmoid 

(OD03868) 


2.2 


94908_PFSK-l_Pnmitive 
Neuroectodennal'Cterebellum_sscDNA 


7.0 


S3222 CCNAT 
(OD03S68) 


0.0 


94909 JXF-498_CNSj5scDNA 


0.0 


83235 CC Mod Dm 
(ODO3920) 


6.3 


94910_SNB-7S_CNS/glioina_sscDNA 


10.2 


83236 CC NAT 
(ODO3920) 


12.6 


9491 i_SF- 

268_CNS/gIioblastoma sscDNA 


0.0 


83237 CCGr.2 
ascend colon 
(OD03921) 


0.0 


94912_T9SG_Glioblastoma_sscDNA 


0.0 


83238 CCNAT 
(OD03921) 


7,7 


96776_SK-N-SHJNeuroblastoma 
(metastasis)_S3cDNA 


0.0 


83241 CC from 
Partial Hepatectoniy 
(ODO4309) 


2.9 


949I3JSF- 

295_CNS/gHoblastoma_sscDNA 


0.0 


83242 Liver NAT 
(ODO4309) 


0.0 


949 14 J>rebellumjscDNA 


66.0 


87472 Colon mets to 
hing(OD04451-01) 


5,0 


96777_Cterebellum_sscDNA 


4.4 


87473 Lung NAT 
(OD04451-02) 


9.9 


9491 6_NCr-H292_Mucoepidermoid 
lung carcinoina_sscDNA 


7.3 


Normal Prostate 
Clontech A+ 6546- 
1 


17.4 


949L7_DMS-114J*mali cell lung 
canccrjacDNA 


0.0 


84140 Prostate 
Cancer (OD04410) 


2.7 


949l8J3M3-79_Suiall cell lung 
cancer/neuroendocrine sscDNA 


100.0 


ft A 4 A < T\ ^ a "V -r 1 nr* 

84141 Prostate NAT 
(OD04410) 


26.2 


949 l9_NCI-Hl46_Small cell lung 
cancer/neuroendocrine sscDNA 


16.4 


87073 Prostate 
Cancer (OD04720- 

01 ) 


20.6 


94920_NCT-H526_SinaIl cell lung 
cancer/ncurocndocrinc_sscDNA 


2.1 


87074 Prostate NAT 
(OD04720-02) 


13.9 


9492LNCI-N4l7_Sinall cell lung 
canccr/ncurocndocrinc sscDNA 


0.0 


Normal Lung 
GENPAK 061010 


17.7 


94923 JNCI-H82_Small cell lung 
cancer/neuroendocrine sscDNA 


3.5 


83239 Lung Met to 
Muscle (0DO4286) 


5.0 


94924JsICI-Hl57_Squamous cell lung 
cancer (metastasis) sscDNA 


0.0 


83240 MusgIb NAT 


2.6 


9492SJSICI-H1 1 55J ATge cell lung 


0.0 
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(OD04286) 




cancer/nenroendocrine sscDNA 




84136 Lung 
Malignant Cancer 


4.9 


9492o_NCI-H1299_Large ceil lung 
cancer/ncurocndocrinc_sscDNA 


11.3 


842 37 Lung NAT 


0.0 


94927_NCI-H727_Lung 
carcinoid sscDNA 


~ 3.0 _ 


84871 Lung Cancer 
(001)4404) 


5.5 


94928_NCI-UMC-1 l_Lung 
carcinoid sscDNA 


10.1 


84872 Lung NAT 

/f~XTXf\AAf\A\ 

(UJJ04404) 


2.4 


94929_LX-l_Small cell lung 
cancer sscDNA 


0.0 


84S75 Lung Cancer 
(OD04505) 


2.5 


94930_Colo-205_Colon 
cancer sscDNA 


0.0 


84876 Lung NAT 
(OD04565) 


777 


9493 1 JKM12_Colon cancer_sscDNA 


4.9 


85950 Lung Cancer 
(OD04237-01) 


18.4 


94932_KM20L2_Colon 
cancer sscDNA 


0.0 


85970 Lung NAT 
(OD04237-02) 


7.8 


9493 3 JNCI-H7 16_Colon 
cancer sscDNA 


0.0 


83255 Ocular Mel 
Met to Liver 
(ODO4310) 


0.0 


94935__SW-48_Colon 
adenocaicinoma_sscDNA 


0.0 


83256 Liver NAT 
(ODO4310) 


0.0 


94936_SW11 16_Colon 
adenocarcinoma sscDNA 


1.6 


84139 Melanoma 
Mets to Lung 
(OD04321) 


3.0 


9493 7 J.S 174T_Coton 
adenocarcinonia_sscDNA 


0.0 


84138 Lung NAT 
(OD04321) 


2.6 


9493SjSW-948_Colon 
adenocarcinoma sscDNA 


4.4 


Normal Kidney 
GENPAK 061008 


100.0 


9493 9_SW-480_CoIon 
adenocarcinoma sscDNA 


0.0 


83786 Kidney Ca, 
Nuclear grade 2 
(OD04338) 


" ~56.6 


94940_NCI-SNU-5_Gastric 
carcinoma_sscDNA 


0.0 


83787 Kidney NAT 
(OD04338) 


23.2 


94941 KATO TTJ_Gasinc 
carcinoma sscDNA 


2.6 


83788 Kidney Ca 
Nuclear grade 1/2 
(OD04339) 


0.0 


94943JNCI-SNU-16_Gastric 
care jnoma_sscDN A 


3.6 


83789 Kidney NAT 
(OD04339) 


46.0 


94944_NCI-SNU-l_Gastric 
carcinoma sscDNA 


0.0 


83790 Kidney Ca, 
Clear cell type 
(OD04340) 


12.9 


94946_RF-l_Gastric 
adenocarcinoitta_sscDNA 


0.0 


S3791 Kidney NAT 
(OD04340) 


24.5 


94947 JtF-48_Gastric 
adenocarcinoma sscDNA 


~ o.o ~ 


83792 Kidney Ca, 
Nuclear grade 3 
(OD04348) 


4.5 


96778 J4KN-45_Gastric 
carcmoma_sscDNA 


0.0 
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83793 Kidney NAT 
j(OD04348) 


44.4 


94949_NCI-N87J3astric 
carcinoma sscDKA 


0.0 


87474 Kidney Cancer 
(OD04622-01) 


2.8 


9495 1 JO VCAR-5_Ovarian 
carcinoma sscDNA 


0.0 


187475 Kidney NAT 
(OD04622-03) 


0.0 


94952_KL95-2_Uterine 
carcinoma sscDNA 


0.0 


85973 Kidney Cancer 
(OD044 50-01) 


4.4 


94953_HelaS3_Cervical 
adenocarcinoma sscDNA 


0.0 


85974 Kidney NAT 
(OD04450-03) 


24.8 j94954_Ca Ski_Ccrvical epidermoid 
icaicinoraa {metastasis)_58cDNA 


5.8 


Kidney Cancer 
Clontech 8120607 


0.0 j94955_ES-2_Ovarian clear cell 
(carcinoma sscDNA 


0.0 


Kidney NAT 
Clontech 8120608 


0.0 


94957 Jlamos/6hstimJ r ; Stimulated 
with PMA/ionomyein 6h sscDNA 


0.0 


Kidney Cancer 
Clontech 8120613 


0.0 


94958_Ramos/14h slimj'; Stimulated 
with PMA/ionomycin I4h sscDNA 


0.0 


Kidney NAT 
Clontech 8120614 


0.0 


94962 JMEG-0 l_Chronic myelogenous 
leukemia (megokuryoblasO_sscDNA 


28.3 


Kidney Cancer 
Clontech 9010320 


2.6 


94963 JtejiJBuriritfs 
lymphoma sscDNA 


0.0 


Kidney NAT 
Clontech 9010321 


8.4 


94964_Dandi_Burfcitt , s 
lymphoma sscDNA 


2.9 


Normal Uterus 
GENPAK 061018 


0.0 


94965_U266J3-cell 
plasmacytoma/myeloma sscDNA 


6.4 


Uterus Cancer 
GENPAK 0640 11 


16.2 


94968 j:A46_Burkitfs 
Iyniphoma_sscDNA 


0.0 


Normal Thyroid 
Clontech A+ 6570- 
1 

Thyroid Cancer 
GENPAK 064010 


1.8 


94970_RL_non-Hodgkin's B-cell 
lymphoma sscDNA 


0.0 


~~ 27.2 "~ 


94972 JMlj>re-B-cell 
Iymphoma/leukeraia sscDNA 


6.7 


Thyroid Cancer 
INVITROGEN 
A302152 


5.2 


94973_JurkatJT cell leukemia_sscDNA 


4.2 


Thyroid NAT 

INVITROGEN 

A302153 


19.9 


94974_TF- lErythroleukemiasscDNA 


2.4 


Normal Breast 
GENPAK 061019 


47.3 


94975 JIUT 78_T-cell 
lymphoma_sscDNA 


2.8 


84877 Breast Cancer 
(OD04566) 


7.0 


94977 JJ937Jfistiocytic 
lymphoma sscDNA 


6.6 


85975 Breast Cancer 
(OD04590-01) 


15,7 


94980 KU-8 1 2 ^Myelogenous 
leukemia sscDNA 


18 2 


85976 Breast Cancer 
Mcts (OD04590-03) 


17.7 


94981 J769-P_Clear cell renal 
carcinoma sscDNA 


2.6 


87070 Breast Cancer 

Metastasis 

(OD04655-05) 


76.8 


94983_Caki-2_Clear cell renal 
carcinoma_sscDNA 


0.0 


GENPAK BTeast 


42.6 


049S4_SW 839_Clear cell renal 


1.9 
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Cancer 064006 


'|Caicinoma_sscDNA 




Breast Cancer Res, 
Gen, 1024 


47.6 p4986j340l_WiW tumor_sscDNA 


0.0 


Breast Cancer 
Ctontech 9100266 
Breast NAT Clontech 
9100265 




y*tyO f XXb /OO 1 JL tUlUCalK/ Uai OlllOIJlii 

(LN merastasis)jsscDNA 

adenocarcinoma (liver 
metastasis sscDNA 


0.0 
0.0 


Breast Cancer 
USTVTTROGEN 

A209073 

Breast NAT 

INVUROGEN 

A2090734 


10.7 


94989_SU86.86_Pancreatic carcinoma 
(liver metastasis)_sscDNA 


3.2 


US 


94990_BxPC-3_Pancreatic 
adenocarcinomasscDNA 


0.0 


Normal Liver 
GENPAK 061009 


13.1 


94991_OTAC_Pancreatic 
adenocarcinoma sscDNA 


3.6 


Liver Cancer 
GENPAK 064003 


4.8 


94992_MIA PaCa-2_Pancreatic 
carcinoma sscjuiwy 


0.0 


Liver Cancer 
Research Genetics 
RNA 1025 


2.5 


94993_CFPAC-l_Pancreatic ductal 
adenocarcinoma_sscDNA 




Liver Cancer 
Research Genetics 
RNA 1026 


0.0 


94994 J>ANC-l_Pancreatic epifhelioid 
ductal carcinorna_sscDNA 


0.0 


Paired Liver Cancer 
Tissue Research 
Genetics RNA 6004- 
T 


5.4 


94996_T24_Bladaer carcinma 
(transitional cell)_sscDNA 


3.1 


Paired Liver Tissue 
Research Genetics 
RNA6004-N 


2.5 


94997_5637_Bladder 
carcjnonja.__ssci/r>j a 


0.0 


Paired Liver Cancer 
Tissue Research 
Genetics RNA 6005- 
T 


0.0 


94998 JIT-U97_Bladder 
Ccirc in o raa_ ssc u im a 


20.2 


Paired Liver Tissue 
Research Genetics 
RNA6005-N 


u.u 


7 *t y yzt^Jjrv 1- U -5_r5 1 aCLQ cr carClilOTBa 

(transitional cell)_sscDNA 


0.0 


Noimai Bladder 

genpako610oi 


3.1 


Q^flDO A204 RhahrifymvnQflTrfwnn ssc 

DNA~ 


3.3 


Bladder Cancer 
Research Genetics 

"DVT A 1M1 

RNA 1023 


0.0 


95001_HT- 

1080_Fibrosarcoma_sscDNA 


52 


Bladder Cancer 
INVITROGEN 
A302173 


16.7 


95002_MG-63_Osteosavoma 
(bone)_sscDNA 


0.0 


87071 Bladder 
Cancer (OD04718- 
01) 


0.0 


95003_SK-LMS-l_Leiomyosarcoma 
(vulva)_sscDNA 


7.9 
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87072 Bladder 
Normal Adjacent 
(OD04718-03) 


2.9 


95004_SJRH30JHiiabdomyosarcoma 
(met to bone marrow)_sscDNA 


0.0 


Normal Ovary Res. 
Gen. 


0.0 


95005_A431JEpideHiioid 
carcinomasscDNA 


0.0 


Ovarian Cancer 
GENPAK 064008 


12.6 


95007_mf266-4_Melanoma_sscDNA 


0.0 


87492 Ovary Cancer 
(OD04768-07) 


2.9 


95010 JXI 145_Prostate carcinoma 
(brain metastasis) sacDNA 


0.0 


87493 Ovary NAT 
(OD04768-08) 


0.0 


95012_MDA-MB^468_Breast 
adenocarcinomasscDNA 


0.0 


Normal Stomach 
GENPAK 061017 


5.9 


95013 JSCC-4_Squaraous cell 
carcinoma of tongue sscDNA 


0.0 


Gastric Cancer 
Clontech 9060358 


0.0 


95014_SCC-9_Squamous cell 
carcinoma of tongue sscDNA 


0.0 


NAT Stomach 
Clontech 9060359 


0.0 


95015_SCC-15_Squamous ceil 
carcinoma of tongue sscDNA 


0.0 


Gastric Cancer 
Clontech 9060395 


2.9 


9501 7_CAT , 27_SquamoiLS cell 
carcinoma of tonguesscDNA 


0.0 


NAT Stomach 
Qontech 9060394 


0.0 






Gastric Cancer 
Qontech 9060397 


0.0 






NAT Stomach 
Clontech 9060396 


0.0 






Gastric Cancer 
GENPAK 064005 


10.5 







Expression of gene MOL6 in panel 1.3D is detected in the testes and at very low levels in 
the spleen, but not in any other tissues. Expression in panel 2D indicates higher expression in 
5 normal kidney and markedly lower expression in kidney cancer. This indicates a potential role 
for this gene as a protein therapeutic in cases of kidney cancer. In panel 3D a expression is seen to 
be highest in a specimen of small cell lung cancer, chronic myelogenous leukemia and bladder 
cancer. 

Expression of gene MOL6 in Panel AD is up-reguJated in keratinocytes and small airway 
10 epithelium after treatment with TNF alpha and IL-1 beta. It is also upregulated in eosinophils 
regardless of treatment and in normal thymus. 

Potential Role(s) of MOL6 in Inflammation: The expression pattern of GM_87760758_A 
shows that it is induced in keralinocytes and small airway epithelium in response to pro- 
inflammatory cytokines. Thus the protein in question may contribute to tissue destruction in the 
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airways, recruitment of leukocytes, and tiss\ie remodeling (Reichart ct aL, 1 996 J. Pathol. 1 78 
(2): 21 5^20). 

Impact of Therapeutic Targeting of MOL6: Antibodies or small molecule therapeutics to 
MOL6 may reduce or inhibit tissue damage due to inflammation in psoriasis, asthma and other 
5 mast cell-mediated diseases both in the skin and in the airways. The Tesults arc also suggestive 
of a potential role for MOL6 in the treatment for emphysema (Rice et al v 1998 Ourr Pharm Des 
4(5): 381-96). 

MOL7 

10 Expression of gene MOL7 was assessed using the primer-probe sets Agl 876, described 

in Table 25. Results of the RTQ-PCRruns are shown m Table 26. 



Table 25. Probe name: Agl876 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Posilio 

B 


SEQ 
ID NO: 


Forward 


5^AGCAAGATTGCTCACACAGAGT-3' 


592. 


22 


668 


91 


Probe 


TET-5'- 

CCAGTCAATACCATCATCATCCATGAGG- 
3-TAMRA 


69.1 


28 


692 


92 


Reverse 


S'-TATGTTGTTGCTCATGGAGTTG-a' 


58.7 


r'22 


730 


93 



15 

Table 26. Panel 1 3D 



Tissue Name 


ReL Expr., % 
1.3dx4tm5422t 
agl876jil 


Tissue Name 


ReL lucpr., % 
1.3dx4tm5422t 
_aal876_al 


Liver 

adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 


786-0 


0.0 


Pancreatic ca. 
CAP AN 2 


M 


Renal ca. 


A498 


0.5 


Adrenal gland 


0.7 


Renal ca. 


RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. 


ACHN 


0.6 


Salivary gland 


0.4 


Renal ca. 


UO-3J 


0.0 


Pituitary gland 


0.0 


Renal ca. 


TK-I0 


0.4 


Brain (fetal) 


0.3 


Liver 


0.0 
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Brain (whole) 


4.2 


Liver (fetal) 


0.0 ! 


Brain (amygdala) 


2.5 


Liver ca. (hepatoblast) HepG2 


0.0 | 


Olaiil ^VvIvt/cilUlilj 


(I X 


[ .iing 


0 0 1 

V.v j 


rjid.ll J t m 1 ijJUi/Hii iffUjti 


0 7 


T 1ITKF (Ffz-ivkW 
l jLLIIk 1 luldl 1 


fl ft 


nigra) 


1 4 

1 .*t 


T nnn t cm n 1 1 i T 
JUUIlg \nX* I blllail LtSLTl. 


1.3 | 


rsrain ^rnaianius j 


1.0 


i-ung ca. \ small cai) inui- 
H69 




L.<5reotdi i^onex 


u.o 


i*ung ca. i s.ceu vai j ;>rLr-/ v 


A O 


Spinal cord 


0.4 


Lung ca. (large ccll)NCI-H460 


0.4 , 
1 


CNS ca. (glio/astro) 

T TOT Tk Mf\ 

U87-MG 


0.0 


Lung ca. (non-sin. cell) A549 


0.5 

1 


CNSca. (glio/astro) 

IT 1 1 CI A /T/""1 

U-llc-Mvj 


0.2 


Long ca. (non-s.cell) NCI-H23 


0.0 | 


CNS ca, Castro) 
SW1783 


0.0 


Lung ca (non-s.cell) HOP-62 


0.0 


CNS ca.* (neuro; met 
) SK-N-AS 


0.7 


Lung ca. (non-sxl) NCI-H522 


0.3 


CNS ca. (astro) 
SF-539 


0.0 


Lung ca. (squam.) SW 900 


0.8 


CNS ca. (astro) 
SNB-75 


0.0 


Lung ca. (squam.) NCI-H596 


0.0 


CNS ca.(glio) 
SNB-19 


22 


Mammary gland 


0.0 


CNS ca. (glio) 
U251 


1.3 


Breast ca.* (pi. effusion) 
MCF-7 


2.9 


CNS ca. (glio) 
SF-295 


0.4 


Breast ca,* (pl.ef) MDA-MB- 
23 1 


0.0 


Heart (fetal) 


0.0 


Breast ca * (pi. effusion) 
T47D 


0.0 


Heart 


0.0 


Breast ca. BT- 
549 


0.6 


Fetal Skeletal 


0.0 


Breast ca. MDA- 
N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


o.o 


Ovarian ca. OVCAR- 

<> 
3 


0.0 


Thymus 


1.0 


Ovarian ca. OVCAR- 

A 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.3 


Lymph node 


0.5 


Ovarian ca. OVCAR- 
8 


0.8 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 
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Stomach 


0.4 


Ovarian ca.* (ascites) SK-OV- 
3 


0.6 


Small intestine 


0.7 


Uterus 


0.0 


Colon ca. 


0.0 


Plaucenta 


5.5 


Colon ca.*(SW480 
met)i>w62U 


0.6 


Prostate 


0.0 


Colon ca. 


0.0 


Prostate ca * (bone met)PC-3 


0.0 


Colon ca. 
HCL-llo 


0.3 


Testis 


100.0 


Colon ca. 
CaCo-2 


0.0 


Melanoma Hs688(A).T 


0.0 


83219 CO Well to 
ModDifif(0DO3866) 


0.0 


Melanoma* (met) Hs6S8(B)T 


0.0 


HCC-2998 




IViCldHLHIUl \J r\\^K^ 




Gastric ca.* (liver 
met)NCI-N87 


0.0 


Melanoma Ml 4 


0.0 


Bladder 


0.0 


Melanoma LOX1MV1 


0.0 


Trachea 


32.3 


Melanoma* (met) SK-MEL-5 


0.0 


Kidney 


0.0 


AdipOic 


8.2 



Expression of MOL7 in panel 1 3D was highest in testis, followed by trachea. Expression 
in the brain is at much lower levels. This molecule may therefore have a role in male fertility. 
There was low to undetectable expression in the samples of panel 4D (CT values >35). 



MOLftb 

Expression of gene MOLSb was assessed using the primer-probe sets Ag3 183, detjeribed 
10 in Table 27. Results of the RTQ-PCR runs are shown in Table 28. 



Table 27. Probe name: Ag3183 



Primers 
Forward 


Sequences 
5'-AAGGGGACGAGTGTGGGATT-3' 


Tm 
62 


Lengt 

h 

20 


Start 

Position 

211 


SEQID 1 
NO: ! 
94 " | 


Probe 


TET-5'-TGGCACCGAAGTAGCCGTGGCG- 
3'-TAMRA 


74 


22 


301 


95 

1 


Reverse 


5'-GCGGGCAC'l'rGGTGTCGCA-3' 


64 


19 


325 


96 | 
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Table 28. Panel 4D 



Tissue Name 


ReL Expr., % 
4dx4tm4998t , 
ag31S3_a2 


Tissue Name 


Rel. Expr., % 
4dx4tm4998t a 
g3183_a2 


S% mm ^ f\ m fill 1 •* 

93768 Secondaiy TM ant- 
CD28/anti-CD3 


0.0 


93100JIUVEC 
(Endothelial) JL- lb 


5.7 


93769_Secondary Th2_an1i- 
t»28/anti-CD3 


0.0 


93779_HUVEC 
(Endothelial) JFN 
gamma 


18.4 


93770_Secandary Trl anti- 
CD28/anti-CD3 


0.0 


93102_HUVEC 
(Endothelial)_TNF 
alpha H IFN gamma 


9.6 


93573 ^Secondary Thlresting 
day4-6inDL-2 


0.0 


93101^ HUVHC 
(Endoihelial)JTNF 
alpha + IL4 


11.2 


93572_Secondary Th2_resting 
day 4-6 in IL-2 


0.0 


93781JBUVEC 
(Endothelial) JL41 


193 


93571_Secandary Trl_resting 
day4-6inlL-2 


2.5 


93583JLung 
Microvascular 
Endothelial Cells none 


21.6 


9356S_primaiy Thl anti- 
CD28/anti-CD3 


0.0 


93584_Lung 
Microvascular 
Endothelial 
CeHsjrNFa{4 ng/ml) 
andILlb(I ng/ml) 


32.1 


93569 jrimary Th2_anti- 
CD28/ariti-CD3 


0.0 


92662JVIicrovascular 
Dermal 

endothelium none 


10/7 


93570 jrimary Trljuiti- 
CD28/anti-CD3 


0.0 


92663_Microsvaaular 
Dermal 

endothelium TNFa(4 
ng/ml) and ILlb (1 
ng/rai) 


S.O 


93565jDrimary Thl_rcfitmg dy 
4-6 in IL-2 


0.0 


93773__Bxonchial 
epitheliumJTNFa (4 
ng/mlj ana 1LI b (I 
ngfaj) ** 


11.6 


93566_piimary Th2_resting dy 
4-6 in IL-2 


2.3 


93347_Small Airway 
Epithelium none 


5.6 


93567 primary TrI resting dy 
4-6 in IL-2 


0.7 


9334S Small Airway 
Epithelium^TNFa (4 
ng/ml) and ILlb (1 
ng/ml) 


4.8 


93351_CD45RACD4 
lymphocyte anti-CD28/anti- 
CD3 


29.9 


92668_Con>nery 
Artery SMC jresting 


46.8 
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lymphocvte anti-CD28/anti- 
CD3 


0 ft 

v.v 


Artery SMC_TNFa (4 
ng/ml)andILlb (1 


HJ.L 


93251_CD8 

Lymphocytesjanti-CD28/aiiti- 


0.0 


93 1 07_astn>cytes_resti 
ng 


82.4 


|93353_chronic CD8 
Lymphocytes 2ry_resting dy 4- 
!6mIT^2 

|w 111 J J^r ^ 


0.0 


93 1 08_astrocytes_TNF 
a(4ng/ml)andILlb (1 

,c fcr *'''/ 


74.9 


b3574_chronic CD8 
CD3/CD28 


0.0 


92666JCU-812 


0.0 


193354 J^D4_none 
■ 


0.4 


92667_KU-812 

fT-lsicATYhil i P\A A/ffinrw 

^AJ do vl pill X JYlXTJ IVJllUjr 

cm 


0.0 


93252 Seconda^ 
|lTilATi2/rrl_aiiti-CD95 CH11 


2.1 


93579.CCD1106 
(Keratinocytes)_noiie 


0.0 


93 1 U3_LAK cells_resting 

l 


0.0 


935S0_CCD1106 
(Keratinocytcs) TNFa 
and IFNg ** 


0.0 


93788_LAK cdls_U ,-2 


0.0 


9379 l_l a ver Cirrhosis 


5.6 


93787 JAK ceils JO.-2-HL-12 


0.0 


93792_Lupus Kidney 


9.3 


93789_LAK cells JL-2-HFN 
'gamma 


0.0 


93577_NCI-H292 


2.5 


93790_LAK ccl]sJL-24-D.-18 


0.0 


93358_NCI-H292_TL-4 


0.0 


<n 1 ffci T ATT 

cells PMA/ionomycin and IL- 

;18 




Q^^fifl MPT-M707 IT O 




93578_NK Cells IL-2_resting 

i 


0.0 


93359_NCl-H292JL- 
13 


3.8 


,93109 Mixed Lymphocyte 
Reaction Two Wav MLR 


0.0 


93357 NCI-H292 IFN 


0.0 


■931 lOJVCixed Lymphocyte 
Reaction^ Two Way MLR 


0.0 


93777_HPAEC_- 


31.9 


93 1 1 l_Mixed Lymphocyte 

-ixva^LLvii x wv/ tv ay ivjuui\. 


0.0 


93778_HPAEC_1L-1 

IsCLar JLJ/NiT. OlyLU* 


22.4 


93112_Moaoimclear Cells 


0.0 


93254 JSformal Human 

r i in Cr P'lKmlil act n rm p 


44.4 


p3113 Mononuclear Cells 
(PBMCs)_PWM 


1.8 


93253_Normal Human 
Lung Fibroblast TNFa 
(4ng/ml)andIL-lb(l 
ng/ml) 


28.3 


93114 Mononuclear Cells 
(PBMCsLPHA-L 


0.0 


93257_Nonnal Human 
Lung FibToblast_lT^4 


62.6 


93249_Ramos (B cell)_uone 


0.0 


93256_Nonnal Human 
LungFibroblasLlL-9 


100.0 
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fcu250_Ramos (B 
pell)_ionomycin 


0.0 


93255_Nbnnal Human 
Lung FibToblastJfL-13 


73.3 


p3349_B lymphocytes^PWM 

l 


0.0 


93258_Nonnal Human 
Lung Fibroblast_IFN 
gamma 


57.1 


,93350 Blymphoytes CD40L 
andIL-4 

i 


0.0 


93106_Dermal 

Fibroblasts 

CCD1070_resting 


91.5 


92665_EOL-1 

(EosinophilLdbcAMP 

differentiated 


0.0 


93361_Deimal 
Fibroblasts 

CCD1070_TOFalpha4 
ng/nil 


31.1 


'93248_EOt-l 

i(Eosmophil)_dbcAMP/PMAio 
nomycin 


1.2 


93105 JDernial 
Fibroblaj-rts 
CCD1070JL-1 beta 1 
ng/ml 


69.7 


93356 JDcndritic Cells none 


1.8 


93772_dennal 
flbrobIast_IFN gamma 


18.3 


93355 J)endritic Cells JLPS 
100 ng/ml 


0.0 


93771_dernial 
fibroblast IL-4 


22.4 


/ / j i Jen antic cells ami- 
CD40" 


i\ A 


0*>KC1 TDH |^ —.1 "4.*- 1 ** 




f /^_MonocyTes_resTing 


'y q 


yJZOU_Xx3JJ ^ Oil US L 


U.U j 


93776_Monoc>tes_LPS 50 
ng/ml 


0.0 


93261JBD Crohns 


0.8 


93581_Macrophages_restiag 


0.0 


735010_Coloa_nonna! 


8.9 


93582 Macrophages LPS 100 
ng/'ml 


0.0 


7350l9_Limg_none 


18.5 


9309SJIUVEC 
(Endothelial) none 


8.4 


64028-1 JITiymusnone 


9.4 


93099JJUVEC 
(Endothelia])_starved 


29.9 


64030-l_Zidney_none 


3.0 



Expression of gene CG50889J02 in two runs of panel LSD was not reproducible and is not considered 
further. 

Expression of gene CG50889J02 in Panel 4D: There is 30-fold increase in the expression of 
5 CG50889JD2 in activated naive T cells (CD4+ CD45RA cells) as compared to resting CD4 cells. This 
protein is expressed both in resting and activated fibroblasts* endothelium, and epitfieJiwn* 
Potential Role(s) of CG50889-02 in Inflammation 

MOL8b may be important in the initial activation of naive T cells. Activated T cells 
initiate the inflammatory process by secreting cytokines and chemokines, which in turn induce B 
10 cell antibody production leading to the extravasation of leukocytes into inflammatory sites. 
Impact of Therapeutic Targeting of MOL8b: 

Antibody or small molecule therapeutics to MOL8b may block T cell activation in 

response to tissue transplant and reduce or block rejection. These therapeutic drug* may also 

reduce or prevent inflammation in asthma/allergy, psoriasis, arthritis and diabetes in which 
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activated T cells play a pivotal role. Antibodies to MOLSb may also serve as a diagnostic or 
experimental tool to identify and differentiate naive activated T cells from more differentiated T 
cell population (memory T cells). 

MOL9a 

5 Expression of gene MOL9a was assessed using the primer-probe set Ag673, described in 

Table 29. Results of the RTQ-PCRruns are shown in Tables 30 and 31. 

Table 29. Probe name: Ag673 



Primers 


Sequences 


Tm 


T^ngth 


Start 
Position 


SfRQTD 
NO: 


Forward 


5'-TAGAG'lTGGTGGTTCCAGGATT-3' 


59.9 


22 


6 


97 t 


Probe 


FAM-5'- 

TGATGTCTCCTCTTCAGGCAATGTCT- 
3-TAMRA 


66.6 


26 


56 


98 


Reverse 


5 , -TCTGCCAGCCACAGTATAGG-3' 


58.9 


20 


83 


99 



10 

Table 30. Panels 1.3D and 2D 



(PANEL 1 3D 




PANEL 2D 


I 


Tissue Name 


Rel. Rxpr., % 
1.3dx4tm5796i 
__ag673 bl 


Tissue Name 


ReLExpr.,% ' 
2Dtm2310f ag6 
73 


Liver adenocarcinoma 


23.4 


Normal Colon GENPAK 
061003 


39.2 1 


Pancreas 


0.9 


83219 CC Well to Mod Diff 
(OD03866) 


15.0 ! 

1 


Pancreatic ca. 
CAPAN2 


8.4 


83220 CC NAT (OD03 866) 


15.1 


Adrenal gland 


0.2 


83221 CC Gr.2 rectosigmoid 
(ODG3868) 


13.1 

i 


Thyroid 


0.9 


83222 CC NAT (OD03S68) 


3.9 


Salivary gland 


0.2 


83235 CC Mod Diff 
(0DO3920) 


20.3 

i 


Pituitary gland 


0.4 


83236 CC NAT (ODO3920) 


7.6 


Brain (fetal) 


8.2 


83237 CC Gr.2 ascend colon 
(OD03921) 


36.3 


Brain (whole) 


4.0 


83238 CC NAT (OD0392I) 


4.8 


Brain (amygdala) 


1.5 


83241 CCfiom Partial 
Hepatectomy (ODO4309) 


30.6 


Brain (cerebellum) 


22 


83242 liver NAT (ODO4309) 


20.3 
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Brain (hippocampus) 


2.9 


87472 Colon mets to lung 


41.5 


Brain (substantia nigra) 


1.6 


87473 Lung NAT (OD04451- 

(Yy\ 
\jZ) 


12,1 


Brain (thalamus) 


1.6 


"Normal Prostate Clontech A+ 


25.5 


Cerebral Cortex 


5.3 


^140 Prostate Cancer 


22.5 


Spinal cord 


1.7 


84141 Prostate NAT 

(UJJU44 W) 


23.2 


CNS ca. (glio/asilio) 

T TOT \AT1. 


6.5 


87073 Prostate Cancer 
(CJ1JU4 /iU-Ul) 


10.7 


CNSca. (glio/astro) 

T T 1 1 O \ JT/""* 

U-1I5-MCj 


8.5 


87074 Prostate NAT 
(OD04720-02) 


24.5 


|CNS ca. (astro) 


21.6 


Normal Lung GENPAK. 
UdjUIU 


27.0 


CNS ca.* (neuro; met ) 


12.6 


83239 Lung Met to Muscle 
(OD04286) 


63.3 


CNS ca. (astro) 
ISFo^9 


3.9 


83240 Muscle NAT 
(OD04286) 


50.7 


CNS ca. (astro) 
isNB-75 


18.7 


84136 Lung Malignant Cancer 
IOD03I26) 


42.3 


CNS ca. (glio) 
SNB-19 


4.3 


841 37 Lung NAT (OD03 126) 


29.1 


CNS ca. (glio) 
U251 


17.7 


84871 Lung Cancer 
(OD04404) 


69.3 


CNS ca. (glio) 
SF-295 


12.6 


84872 Lung NAT (OD04404) 


• 26.8 


Heart (fetal) 


0.2 


84875 Lung Cancer 
(OD04565) 


19.1 


Heart 


0.6 


84876 Lung NAT (OD04565) 


122 


Fetal Skeletal 


0.8 


85950 Lung Cancer 
(OD04237-01) 


94.0 


Skeletal muscle 


6.5 


85970 Lung NAT (OD04237- 
02) 


20.3 




v. f 


R^7SS Ovular MpI Mf*t in 

Liver (ODO4310) 


74 7 


inyinus 


ft A 




An i 


Spleen 


0.3 


84139 Melanoma Mets to 
Lung(OD0432I) 


36.3 


Lymph node 


0.8 


84138 Lung NAT (OD04321) 


29.7 


Colorectal 


\2 


Nomial Kidney GENPAK 
061008 


62.4 


Stomach 


02 


S3786 Kidney Ca, Nuclear 
grade2(OD04338) 


66.0 


Small intestine 


OA 


83787 Kidney NAT 
(OD04338) 


23.3 
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Colon ca. 


10.2 


83788 Kidney Ca Nuclear 
grace ui (UL/iwjjt/ 


34.9 


Colon ca.*(SW480 
met;e>woZU 


59.0 


83789 Kidney NAT 


31.0 


Colon ca. 


8.4 


83790 Kidney Ca, Clear cell 

Type {jJUSJHjW) 


45.7 


Colon ca. 


14.7 


83791 Kidney NAT 


33.0 


Colon ca. 
uat^o-z 


18.6 


83792 Kidney Ca, Nuclear 
graae J tuuvw o ) 


18.3 


83219 CC Well to Mod 
Din (UUUioObj 


2.2 


83793 Kidney NAT 


42.9 j 


Colon ca. 
HLL-29ye 


16.6 


S7474 Kidney Cancer 


14.5 


Gastric ca.* (liver met) 


12.3 1 


87475 Kidney NAT 
(UJJU402Z-U3 J 


3.9 


Bladder 


5.9 


85973 Kidney Cancer 


70.2 


Trachea 


0.3 


85974 Kidney NAT 
(OD04450-03) 


31.6 


Kidney 


1.8 


Kidney Cancer Clonlech 
8120607 


11.4 


Kidney (fetal) 


7.8 


Kidney NAT Clontech 
8120608 


7.1 


Renal ca. 
786-0 


8.5 


Kidney Cancer Clonlech J 8.2 
8120613 ! 


Renal ca. 

a a no 

A498 


7.0 


Kidney NAT Clontech 8.1 
8120614 


Renal ca. 


17.6 


Kidney Cancer Cloutech 1 8.3 
9010320 


Renal ca. 


3.9 


Kidney NAT Clontech 33.4 
9010321 


Renal ca. 

UU-31 


12.8 


Normal Uterus GENPAK 11.8 
061018 


Renal ca. 
TK-20 


23.4 


L'tenis Cancer GENPAK 30.1 
064011 


Liver 


1.4 


Normal Thyroid Clontech A+ 17.3 
6570-1 


Liver (fetal) 


1.0 


Thyroid Cancer GENPAK . 39.2 
064010 


Liver ca. (hepatoblast) 


9.3 


Thyroid Cancer ' 18.7 
INVITROGEN A302152 , 


Lung 


77.4 


Thyroid NAT INVITROGEN 
A302153 


18.7 


Lung (fetal) 


1.7 


Normal Breast GENPAK 
061019 


32.3 


Lung ca. (small cell) 
LX-1 


57.5 


84877 Breast Cancer 
(OD04566) 


40.6 
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Lung ca. (small cell) 
.NCI-Hoy 


11.7 


85975 Breast Cancer 
{\JU\y±jy\j-\j i ) 


100.0 


(Lung ca. (s.cell var.) 
SHr-7/ 


100.0 


85976 Breast Cancer Mets 
tUDU4S9U-03 ) 


76.3 


Lung ca. (large 


6.3 


87070 Breast Cancer 
Metastasis ^uuu4u j j-Uj y 


73.2 


Lung ca. (non~sm. cell) 
'A549 


28.7 


GENPAK Breast Cancer 

fUZAflf\£. 

Uo4UUo 


46.0 


Lung ca. (non-s.cell) 
NCI-H23 


4.8 


Breast Cancer Res. Gen. 1024 


31.2 


Lung ca (non-s.cell) 
HOP-62 


3.1 


Breast Cancer Clontech 
9100266 


41.5 


Lung ca. (non-s.cl) 
NCI'H522 


31.7 


Breast NAT Clontech 
9100265 


13.4 


Lung ca. (squam.) 
SW 900 


19.4 


Breast Cancer INVITROGEN 
A209Q73 


39.2 


Lung ca. (squam.) 
NCI-H596 


27.4 


Breast NAT INVITROGEN 
A2090734 


39.8 


Mammary gland 


3.7 


Nonnal Liver GENPAK 
061009 


20.9 


Breast ca * (pi. 
effusion) MCF-7 


10.6 


Liver Cancer GENPAK 
064003 


15.6 


Breast ca.* (pl.ef) 
MDA-MB-231 


— 

6.6 


Liver Cancer Research 
Genetics RNA 1025 


17.2 


Breast ca * (pi. 
effusion) T47D 


17.8 


Liver Cancer Research 
Genetics RNA 1026 


9.6 


Breast ca. 
BT-549 


15.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6004-T 


42.9 

1 


Breast ca. 
MDA-K 


5.6 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


18.6 


Ovary 


1.4 


Paired Liver Cancer Tissue 
Research Genetics RNA 
600 5-T 


12.2 


Ovarian ca. 
OVCAR-3 


8.4 


Paired Liver Tissue Research 

/"I — a£_— Ti'VT A £ (\f\ C XT 

Genetics RNA 6005-N 


8.1 


Ovarian ca. 
OVCAR-4 


2.6 


Nonnal Bladder GENPAK 
061001 


33.7 


Ovarian ca. 
OYCAR-5 


44.9 


Bladder Cancer Research 
Genetics RNA 1023 


5.7 


Ovarian ca. 
OVCAR-8 


5.0 


Bladder Cancer 
INVITROGEN A302173 


23.5 


Ovarian ca. 
IGROV-1 


5.2 


87071 Bladder Cancer 
(OD04718-01) 


69.7 


Ovarian ca."* (ascites) 
SK-OV-3 


50.1 


87072 Bladder Normal 
Adjacent (OD047 18-03) 


20.6 


Uterus 


0.7 


Nonnal Ovary Res. Gen. 


2.9 


Plancenta 


0.0 


Ovarian Cancer GENPAK. 
064008 


18.6 
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prostate 


0.4 


87492 Ovary Cancer j 53.2 
(OD04768-07) ! 


prostate ca * (bone 
met)PC-3 


1.9 


87493 Ovary NAT 
(OD04768-08) 


13.5 


Testis 


0.3 


Wonnal Stomach GENPAK 
P61017 


15.0 


Melanoma 
Hs68S(A).T 


2.1 


Gastric Cancer Clontcch 
9060358 


2.5 


Melanoma* (met) 
Hs688(B).T 


2.3 


NAT Stomach Clontech 
9060359 


8.3 


Melanoma 
UACC-62 


5.7 


Gastric Cancer Clontcch 
906039a 


19.1 


Melanoma ! 5.6 
M14 I 


NAT Stomach Clontech 
9060394 


4.C 


Melanoma j 12.5 
LOXIMV1 


G astric Cancer Clontcch 
9060397 


29.5 


Melanoma* (met) SK- 
MEL-5 


11.3 


NAT Stomach Clontcch 
9060396 


3.7 


Adipose 


32 


Gastric Cancer GENPAK 
064005 


9.9 



Table 31. Panels 3D and 4D 



PANEL 3D J 


jPANEL4D 




Tissue Name 


ReLExpr.,% 
3dx4tm5137f 
_ag673_bl 


Tissue Name 


Rel Expr., 

% 

4dto4833f 
ag673 


94905_DaoyJVleduUoblasto 
ma/Cerebelluni sscDNA 


13.6 


93768_Secondaiy Thl_anti- 
CD28/anti-CD3 


11.9 


94906jre671JMediilloblast 
OTn/CeTebelhim sscDNA 


11.6 


93769_Secondaty Th2_anti- 
CD28/anti-CD3 


8.2 


94907_D283 

Mcd_Medulloblastotna/Cere 
bcllum sscDNA 


53.3 


93770_Secondaiy Trl_anti- 
CD28/anti-CD3 


6.6 
0.0 


94908 JPFSK-lJ?rmritive 
Nenroectodermal/Cerebellum 
sscDNA 


9.3 


93573_Secondary Thl_resting 
day 4-6 in EL-2 


94909 XF- 

498 CNS sscDNA 


7,8 


93572_Secondaiy Th2_resung 
day4-6inIL-2 


0.8 


94910JSNB- 

78^CNS/glioma_sscDNA 


8.6 


93571_Secondary Trl resting 
day4-6inIL-2 


0.5 


9491 1_SF- 

268 CNS/glioblastonia sscD 
N r A 


14.1 


93568 ^primary Thl_anti- 
CD28/anti-CD3 


17.7 


949 1 2 JT98G_Glioblastoma_ 
sscDNA 


15.3 


93569_primary Th2jmti- 
CD28/anti-CD3 


9.1 


9677G_J>K-N- 

SH Neuroblastoma 


15.1 


93570_priniaiy Trl_anti- 
CD28/anti-CD3 


16.3 
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(rnetastasis)_sscDNA 


r n 






949l3_SF- 

295_CNS/g2ioblastoma_sscD 
NA 




93o6D_pnmatyThl resting dy 
^inIL-2 


7.9 

2.6 


94914_Cerebeltum_sscDNA 


5.5 

1 


93566jDrhnaryTh2 resting dy 
k-6mIL-2 


96777 jCferebelliiin_sscDNA 


0.9 


93567jprimary Trl_resting dy 
4-6 in IL-2 


4.3 


94916_NCI- 

H292_Mucoepidermoid lung 
carcinoma sscDNA 


36.5 


9335t_CD45RACD4 
lymphocyte anti-CD28/anti- 
CD3 

93352_CD45RO CD4 
lymphocyte anti-CD28/anti- 
CD3 


13.4 ; 
¥.9 ~ 
12.9 


94917_DMS-1 14_Small cell 
ftxng cancer_sscDNA 


S.I 


9491 8_DMS-79_Small cell 
lung 

cancei/neuioenctocrine sscD 
NA 


100.0 


93251_CD8 

Lymphocytes anti-CD28/anti- 
CD3 


949l9_NCI-H146JSmall cell 
lung 

caucer/neuroendocrine sscD 
NA 


32.1 


93353_chrojiic CDS 
Lymphocytes 2ry_rc sting dy 4- 
6inlL-2 


10.5 


94920_NCI-H526_Small cell 
lung 

c anc cr/ neuroendocrine sscD 
NA 


30.0 


93574_clnonic CD8 
Lymphocytes 2ry activated 
CD3/CD28 


5.2 


94921_NCI-N417_Smalt cell 
lung 

cancer/neuroendocrine_sscD 
NA 


30.6 


93354_CD4_none 


1.4 


94923 NCI-HS2 Small cell J 22.6 
lung ( 
cancer/neuroendocrine sscD j 
NA 1 


93252 Secondaty 
TM/rh2/rrl_anti-CD95 CHI I 


0.8 


94924_NCI- 

H157_Squamous cell lung 
cancer (metastasis)_sscDNA 


67.1 


93103_LAK cells_resting 


18.0 


94925JSO-H1 155_Large 
cell lung 

cane cr/ncurocndocri ne sscD 
NA 


19.8 


93788_LAKcells_IL-2 


7.8 


94926J\ 1 CI-H1299_Large 
cell lung 

cancet/neuroeiidociine sscD 
NA 


36.3 


93787_LAK cells_IL-2+IL-12 


8.1 


94927JS - CI-H727_Lung 
carcinoid sscDNA 


20.2 


93789_LAK cells JL-2+IFN 
gamma 


13.6 


94928JvQ-UMC-l l_Lung 
carcinoid sscDNA 


73.3 


93790JLAK cells_IL-2+ IL-18 


10.9 
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94929_LX-]_Small cell lung 
caneer_sscDNA 


20.2 


93104_LAK 

celIs_PMA/ionomycin and 1L- 

to 

18 


4.8 


94930_Colo-205_Colou 
canccr_sscDNA 


11.6 


93578 NK Cells IL-2j:estmg 


1.7 


94931 JCM12._Colon 
cancer sscDNA 


31.6 


93109JVlixcd Lymphocyte 
Reaction_Two Way MLR 


13.0 


94932J£M20L2_Colon 
cancer sscDNA 


8.5 


9311 0_Mixed Lymphocyte 
ReactionTwo Way MLR. 


11.4 


94933__NCI-H716_Colon 
cancer sscDNA 


36.9 


93111 Mixed Lymphocyte 
ReactionJTwo Way MLR 


3.2 


94935_SW-48_Colon 
adenocarcinoma sscDNA 


12. 5 |93 1 1 2_Mononuclear Cells 


1.4 


94936_SW1116_Colon 
adenocarcinoma sscDNA 


9.0 (93 J 13 Mononuclear Cell's 
j(FBMCs)_PWM 


23.7 


94937_LS I74TColon 
adenocarcinoma sscDNA 


32.6 33114 Mononuclear Cells 
'(PBMCs) PHA-T. 


8.7 


9493SJSW-948_Cokm 
adenocarcinoma sscDNA 


1 J 


93249JUrm>s (B cell)_none 


19.9 


94939_SW-480_Colon 
adenocarcinoma sscDNA 


6.7 


93250_Ramos (B 
cell) ionomycin 


100.0 


94940 NCl-SN U-5_Gastric 
carcinoma sscDNA 


12.4 


93349J* lymphocytea^PWM 


65.1 


94941_KATO in_Gastric 
carcinoma sscDNA 


55.9 


93350JB lymphoytes_CD40L 
and IL-4 


6.0 


94943 JNCI-SNU-16_Gastric 
carcinoma_sscDNA 


12.2 


92665_EOL-l 

(Eosinophil)_dbcAMP 

differentiated 


12.5 


94944_NCI-SNU-1_Gastric 
caicinoma_sscDNA 


58.1 


93248_EOL-1 

(Eosinophil^dbcAMP^MAio 
nomycin 


4.4 


94946_RF-l_Gastric 
adenocaicinoma_sscDNA 


11.4 


93356JDendritic Cellsjione 


9.5 


94947 JO^JJastric 
adenocaicinomasscDNA 
96778_MKN-45_Gastric 
carcinoma sscDNA 


15.3 


93355JDendritic Cells_LPS 
100 ng/ml 


8.7 


33.2 


93775 Dendritic Cells anti- 
CD40 


12.2 


94949_NC1-N87_Gastric 
carcinoma sscDNA 


16.7 


93 774_Monocytes_resting 


14.8 


9495 3 J3VCAR-5 ^Ovarian 
carcinoma sscDNA 


10.0 


93776 JVlonocytes_LPS 50 
ngfwi 


15.5 


94952_RL95-2_Uterine 
carcinoma sscDNA 


7.6 


93581_Macrophagesj:esting 


35.4 


94953 JJdaS3J>rvical 
adenocarcinoma sscDNA 


11.1 


93582 JVIaciophagesJ-PS 100 
agfaA 


4.4 


94954_C a SkLCervical 
epidermoid carcinoma 
(metastasis) sscDNA 


28.9 


93098 JKUVEC 
(Endothelial)_none 


27.0 


94955_ES-2_Ovarian clear 


15.5 


93099_HUVEC 


39.2 
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cell carcinoma_sscDNA 




(Endotiielial)_starved 




94957J$amos/6h stim_"; 
Stimulated wilh 
PMA/ionomycin 6h sscDNA 


15.8 


93100JIUVEC 
(Endo4elial)JL-lb 


12.7 
17.1 


|94958_Ramos/?4h stitn_"; 
Stimulated with 

hnm it k r * 

OPMA/ionorriycin 
14h sscDNA 


12.4 


93779 JIUVEC 
(Endotheliai)_IFN gamma 


•94962_MEG-01_Chromc 
myelogenous leukemia 
(megokaryoblast)_sscDNA 


25.5 


93102_HUVEC 
(Endothelial)_TNF alpha + 
EFN gamma 


14.0 


94963_RajiJBurkitt , s 
lymphoma sscDNA 


8.4 


93101JKUVKC 
(EndothclialLTNF alpha t 1L4 


22.4 


949<M_Daudi_Bmtttfs 
Iymphoma_sscDNA 


29.1 


93781_HUVEC 
(bndothehaI)_lL-ll 


11.7 


94965 JJ266_B-cell 
plasmacytoma/myeloma ssc 
DNA 


8.0 


93583_Lung Microvascular 
Endothelial Cells jnone 


14.7 


94968_CA46J3irrkiU's 
lymphoma_sscDNA 


72 


93584_Luiig Microvascular 
Endothelial CclIsJTNFa (4 
ng/ml) and ILlb (1 ng/ral) 


192 


94970 JlLjnon-Hodgkin's B- 
ceD lymphoma_sscDNA 


10.7 


92662_Microvascular Dermal 
endothelium none 


26.8 


94972 JTMljpre-B-cell 
lymphoma/leukemia sscDN 
A 


5,5 


92663_Microsvasular Dermal 
endotheliran_TNfa (4 ng/ml) 
and ILlb (1 ng/ml) 


182 


94973_Jurkat_Tcell 
leukemiajsscDNA . 


15.5 


93773 JBronchial 
epithelium TM«a (4 ng/ml) 
and ILlb (1 ng/ml) ** 


2.4 


94974JTF- 

1 EtythroleukemiasscDNA 


33.9 


93347_SmalI Airway 
Epitbelium_none 


8.7 


94975 jaUT7S_T-cell 
rymphoma_sscDNA 


23.7 


93348_Small Airway 
Epithelium TNFa (4 ng/ml) 
and ILlb {1 ng/mJ) 


37.9 


94977_U937_Histiocytic 
l>TOphoma_sscDNA 


29.4 


92668_Coronery Artery 
SMC resting 


17.4 


94980_KU- 

8 1 2_Myelogenous 

■ 1 ■ T\\T A 

leukemia sscDNA 


27.2 


92669_Coronery Artery 
SMCJTNFa (4 ng/ml) and 
EL lb (1 ng/ml) 


7.4 


94981_769-P_Clearcell 
renal carcinoma sscDNA 


173 


931 07_astrocyte3_resting 


13.1 


949S3_Caki-2_Clear cell 

icllal CaTOinoma SSC»lJlN/v 


19.9 


93 108_astrocyres_TNFa (4 
iig/mij ana ilid ^i ng/mij 


9.3 


94984_3W 839_Clearcell 
renal carcmomajsscDNA 


15.4 


92666_KU~812 
(Basophil)_resting 


13,0 


94986J340i_Wilms , 
tumor sscDNA 


19.5 


92667 JCU-8 12 
(Basophil) PMA/ionoycin 


26.8 


94987_Hs766T_Pancreatic 
carcinoma (LN 


12.7 


93579JXD1106 
(KeraLiriocytes)_nojie 


14.9 
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metastasis)_sscDNA 








94988_CAPAN-1 - Pancreatic 
adenocarcinoma (liver 
metastasis) sscDNA 




93580_CCD1106 
(Keratmocytes) TNFa and 
IFNg ** 


L4 


94989 JiU86.86_Pancreatic 
carcinoma (liver 
metastasis)_sscDNA 


38.5 


93791_Liver CiTrhosis 


2.3 


94990J3xPC-3JPancreatic 
adenocarcinoma sscDNA 


82 


93792_Lupus Kidney 


L0 j 


94991 JIPAC_Pancreatic 
adenocarcinoma jsscDNA 
94992 JvOAPaCa- 
2_Pancreatic 
carcinoma sscDNA 


24.3 


93577_Na-H292 


12.3 


5.6 


93358_NCI-H292_IL-4 


20.0 i 


94993 J^PAC-l^ancreatic 
ductal 

adenocarcinoma sscDNA 


322 


93360_NCI-H292_IL-9 


31.4 


94994 J>ANC-lJ>ancreatic 
epithelioid ductal 
carcinoma sscDNA 


29.0 


93359_NCHi292_IL-13 


13.7 1 


94996 JT24_Bladdcr 
carcimna (transitional 
cell)_sscDNA 


13.0 


93357 JNCI-H292JFN gamma 


152 


94997_5637J81adder 
carcinoma sscDNA 


18.4 


93777_HPAEC_- 


14.7 


94998_HT-1 1 97JBladder 
carcinoma sscDNA 


25.1 


93778_HPAEC_IL-1 
beta/TNA alpha 


12.7 


94999_UM-UC-3_Bladder 
carcimna (transitional 
cetl)_sscDNA 


3.3 


93254_Normal Human Lung 
FibroblasMione 


8.0 


95000_A204_Rhabdomynsar 
coma_sscDNA 


16.1 


93253_Normal Human Lung 
FibroblastJTNFa (4 ng/ml) and 
IL-lb(l ng/m0 


3.6 


95001 HT- 

1080 Fibrosarcoma sscDNA 


16.7 


93257_NoTtnal Human Lung 
Fibroblast IL-4 


18.7 


95Q02__MG- 
63_Ostco sarcoma 
(bone) sscDNA 


12.9 


93256_Normal Human Lung 
Fftroblast_IL-9 


13.5 


95003_SKrLMS~ 
1 Leiomyosarcoma 
(vulva)_sscDNA 


34.7 


93255_Normal Human Lung 
Fibroblast_IL-13 


10.2 


95004_SJRH30_RJaabaoniy a 
sarcoma (met to bone 
marrow)_sscDNA 


16.9 


93258_Normal Human Lrnig 
FibroblastlFN gamma 


1 1 J. 


95005_A43 ^Epidermoid 
carcinoma sscDNA 


9.4 


93 106 JDermal Fibroblasts 
CCD1070 resting 


39.0 


95007_WM266- 

4 Melanoma sscDNA 


6.9 


93361 JDermal Fibroblasts 
CCD1 070 TNF alpha 4 ng/ml 


47.0 
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95010_UU 145_Prostate 
carcinoma (brain 
metastasis) sscDNA 


0.3 


p3105_Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/m1 


15.7 


95012_MDA-MB- 
468JJreast 

adenocarcinoma sscDNA 


13.4 


03772_demial fibroWastJFN 
gamma 


5.0 


95013_SCC-4_Squamous 
cell carcinoma of 
tonguesscDNA 


0.1 


93771_dermal f!broblast_IL-4 


11.7 


95014_SCC-9_Squamous 
cell carcinoma of 
tongue sscDNA 


02 


93259JBD Colitis 1** 


0.1 


95015_SCC-15_Squamoufi 
cell carcinoma of 
tongue^sscDNA 


0.8 


93260JBD Colitis 2 


0.3 


95017_CAL 27_Squaracus 
cell carcinoma of 
tongue sscDNA 


29.0 


03261 JBDCrohns 


0.5 






735010_Colon_nonnal 


3.1 






735019_Lung_nonc 


7.1 






64028-l_Thymus_none 


9.4 






64030-1 JKidney_none [ 


3.0 



Panel 1.3D description: 77ie geneM0L9a is expressed in a ttwnber of tissues, including the central 
nervous system, lung, mammary gland and kidney. Moreover, its expression seems to be enhanced in 
Uanor ceU lines as compared to normal tissue in most cases with a good therapeutic window. 
5 Panel 2D description: Tissue distribution of this gene MOL9a in panel 2D confirms the result? obtained 
in panel 1.3 D. There is enhanced expression of this gene MOL9a in tumor tissue as against tlie normal 
adjacent tissue, pariiadarly in king and kidney cancer, but also in cases of colon cancer, ovarian cancer, 
breast cancer, gastric ccficer, bladder cancer, liver cancer and thyroid cancer. Some metastases, 
particularly die lung and those from melanoma express high levels vfMOL9u. Corroborative infon nation 
1 0 about the expression of this molecule is available in the form of ESTs, mostly from endothelial cells, 

colon, ovarian tumors, pancreas and brain regions. Panel 3D demonstrates increased expression of this 
gene MOL9a in a variety of carcinomas, supporting the results of panels L3D and 2D thus demonstraang 
utility for this protein as mi antibody target Therefore antibodies specific to this protein may be used as 
a therapeutic in the treatment of various types of cancer. 

15 

Panel 4D Description: This gene MOL9a is unregulated in endothelium, and epithelium 
regardless of stimulus. There is also high level expression of this protein in ionomycin-treated B 
cell line and mitogen (pokeweed mitogen, PWM) treated B cells. Consistent with this finding 
Peripheral blood mononuclear cells (PBMC) treated with PWM also demonstrate increased 
20 expression of this molecule. Further, induction of MOL9a is seen in activated T cells. In PBMC 
the T cell specific mitogen {Phytohcmaggluthiin, PHA) induces the expression of this transcript 
and in acute and chronically activated T cells the expression of this transcript is increased as 
compared to untreated or resting T cells. 
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The M0L9a is induced in activated B and T lymphocytes and may thus have a potential 
role in inflammation by regulating lymphocyte trafficking, or activation, or increasing tissue 
destruction- This molecule may also serve as a marker for activated T or B cells. 

Impact of Therapeutic Targeting of MOL9a: Small molecule or antibody therapies to the 
5 molecule encoded by MOL9a may inhibit tissue damage due to T or B cell activation and the 
bioactive molecules produced by these cell types. These diseases would include asthma'allergy, 
colitis, Crohn's disease, lupus, and arthritis. Alternatively, protein therapeutics based on this 
molecule could serve as an adjuvant and help boost the effectiveness of vaccines or regulate 
immune status during organ transplant 1 95067 19_BJBXT may also serve as a marker for 
10 activated X cells and serve as a diagnostic tool in determining the extent of inflammation in 
autoimmune diseases such as asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. 



MOL9b 

Expression of gene MOL9b was assessed using the primer-probe set Ag2458, described 
15 in Table 32. Results of the RTQ-PCR runs are shown in Tables 33 and 34. 



Table 32. Probe name: Ag2458 



! Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQID 
NO: 


j Forward 


5'-GTTGGTGGTTCCAOGATTTT-3' 


58.7 


20 


58 


100 


1 Probe 

i 

i Reverse 


TbT-5'- 

TGATGTCTCCTCTTCA-GGCAiVTGTCT- 
3'-TAMRA 


66.6 


26 


104 


101 


5'-CTGCCAGCCACAGTATAGGA-3' 


58.9~ 1 


20 


130 


102 



20 Table 33. Panels 1 JI> and 2D 



PANEL 1. 3D 






PANEL 2D 




Tissue Name 


Rel. Expr., 

% 

1.3dtm4270t 
__ag2458 


Rel. Expr., 

% 

1.3dx4tm54 
07t ag245E 
a2 


Tissue Name 


Rel. Expr., 
% 

2dtm4271t 
ag2458 


Liver 

adenocarcinoma 


33.9 


41.4 


Normal Colon GENPAK 
b61003 


62.4 


Pancreas 


U 


0.8 


83219 CC Well to Mod Diff 
(OD03866) 


16.8 


Pancreatic ca. 


6.9 


44.3 


83220 CC NAT (OR03866) 


11.7 
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Adrenal gland 


U.7 


I.I 


S322T CC Ctt.2 rectosigmoid 
(OD03868) 


in/ 
1 ft.O 


Inyroid. 




J.J 


83222 LL MAI (UUUiooo) 


Q A 
0.4 


Salivary gland 


U.o 


2SJ 


83235 CC Mod Ditl 
(ODO3920) 


34.2 


Pituitary gland 


1 * 


l\ A 

0.4 


83236 CC NAT (ODO3920) 


11.3 


Brain (tetal) 


9.2 


19.7 


83237 CC Gr.2 ascend colon 
(OD03921) 


74.7 


Brain (whole) 


5.4 


17.7 


83238 CC NAT (OD03921) 


9.6 


Brain (amygdala) 


6.7 


9.0 


83241 CC from Partial 
Hepatectomy (ODO4309) 


35.6 


Brain (cerebellum) 
Brain 

(hippocampus) 


2.7 


10.2 


83242 Liver NAT 
(ODO4309) 


32.3 




87472 Colon niets to lung 
(OD04451-01) 


29.7 


Brain (wibslantia 
nigra) 


1.6 


7.2 


S7473 Lung NAT 
(OD04451-02) 


C.4 


Brain (thalamus) 


4.9 


16.6 


Normal Prostate Clontooh 
A+ 6546-1 


8.8 


Cerebral Cortex 


26.2 


8.2 


84140 Prostate Cancer 
(OD04410) 


25.9 


Spinal cord 


2.4 


9.8 


84141 Prostate NAT 
(OD04410) 


27.9 


CNSca.(glfcw'astro) 
U87-MG 


11.0 


19.7 


87073 Prostate Cancer 
(OD04720-01) 


15.8 


CNSca. 

(glio/astro) U-118- 
MG 


45.1 


82.4 


87074 Prostate NAT 
(OD04720-02) 


28.1 


CNS ca. (astro) 
SW1783 


21.9 


46.0 


Normal TjingGFNPAK 
061010 


24.8 


CNS ca.* (nemo; 
met)SK-N~AS 


73.7 


40.2 


83239 Lung Met to Muscle 
(OD04286) 


100.0 


CNS ca. (astro) 
SF-539 


9.9 


12.0 


83240 Muscle NAT 
(OD04286) 


31.9 


CNS ca. (astro) 
SNB-75 


22.5 


71.5 


84136 Lung Malignant 
Cancer (OD03 126) 


29.5 


CNS ca. (glio) 
SNB-19 


6.2 


18.1 


841 3 7 Lung NAT 
(OD03126) 


27.9 


CNS ca> (glio) 

U251 


5.9 


45.0 


84871 Lung Cancer 

(f~\T\<\A A f\A\ 


71.7 
15.1 


CNS ca. (glio) 
SF-295 


13.0 


26.4 


84872 Lung NAT 
(OD04404) 


Heart (fetel) 


1.7 


0.0 


84875 Lung Cancer 
(OD04565) 


28.5 


Heart 


1.2 


4.4 


84876 Lung NAT 
(OD04565) 


13.1 
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Fetal Skeletal 


11.4 


12 


85950 Lung Cancer 

(OD 04237-01; 


90.1 


Skeletal muscle 


3.8 


41.7 


85970 Lung NAT 
(OD04237-02) 


16.8 


Bone marrow 


1.5 


2.2 


83255 Ocular Mel Met to 
Liver (ODO4310) 


76.3 | 


Thymus 


0.8 


0.4 


83256 Liver NAT 
(ODO4310) 


26.8 


Spleen 


0.9 


0.8 


841 39 Melanoma Mets to 
Lung (OD04321) 


36.3 1 


Lymph node 


1.3 


10.4 


84I38 Lung NAT 
(OD04321) 


22.5 1 

i 

1 


Colorectal 


4.8 


3.1 


Normal Kidney GENPAK 
061008 


33.7 


Stomach 


0.0 


3.1 


83786 Kidney Ca, Nuclear 
grade 2 (OD04338) 


68.8 


Ismail intestine 

| 


1.1 


2.3 


83787 Kidney NAT 
(OD0433S) 


33.4 


Colon ca. 
SW480 


19.5 


18.9 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


30.6 


Colon ca.*(SW4S0 
met)SW620 


29.9 


29.8 


83789 Kidney NAT 
(OD043391 


27.2 


Colon ca. 
HT29 


17.7 


9.7 


83790 Kidney Ca, Clear cell 
type (OD04340) 


49.3 


Colon ca. 
HCT-116 


22.1 


43.8 


83791 Kidney NAT 
(OD04340) 


32.1 


Colon ca. 
CaCo-2 


13.5 


18.4 


83792 Kidney Ca, Nuclear 
grade3(OD04348) 


16.0 


83219 CC Well to 
Mod Diff 
(OD03866) 


10.6 


7.0 


83793 Kidney NAT 
(OD04348) 


35.4 


Colon ca. 
HCC-2998 


45.7 


21.1 


87474 Kidney Cancer 
(OD04622-01) 


13.2 


Gastric ca.* (liver 
met) XC1-N87 


38.4 


69.5 


87475 Kidney NAT 
(OD04622-03) 


4.0 


Bladder 


7.4 


13.6 


85973 Kidney Cancer 
(OD04450-01) 


48.6 


Trachea 


2.9 


3.0 


85974 Kidney NAT 
(OD04450-03) 


30.4 


Kidney 


1.3 


3.5 


Kidney Cancer Clontech 
8120607 


20.4 


Kidney (fetal) 


3.6 


3.9 


Kidney NAT Clontech 


7.3 


Renal ca. 
786-0 


10.0 


19.0 


Kidney Cancer Clontech 
8120613 


14.4 


Renal ca. 
A498 


29.7 


28.6 


Kidney NAT Clontech 
8120614 


8.8 


Renal ca. 
RXF393 


5.6 


53.0 


Kidney Cancer Clontech 
9010320 


12.2 
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fRj&nal ca. 
ACHN 


5.5 


13.3 


Kidney NAT Clontech 
yu 10321 


22.1 


Renal ca. 
UO-31 


21.6 


45.5 


Normal Uterus GENPAK 
UclUlo 


8.3 


Renal ca. 
TK-10 


20.4 


27.0 


Uterus Cancer GENPAK 
064011 


153 


Liver 


2.9 


2.4 


Normal Thyroid Clontech 
A+ 6570-1 


15.1 


Liver (fetal) 


3.8 


5.0 


Thyroid Cancer GENPAK 
064010 


33.0 


Liver ca. 

(hepatohlast) 

HepG2 


17.3 


28.2 


Thyroid Cancer 
INVITROGEN A302152 


21.6 


Lung 


1.9 


2.5 


Thyroid NAT 
INVITRCHjEN A302153 


14.4 


Lung (fetal) 


1.4 


5.6 


Normal Breast GENPAK 
061019 


332 


Lung ca. (small 
cell) LX-1 


11.8 


40.6 


84877 Breast Cancer 
(OD04566) 


44.8 


Lung ca. (small 
cell) NC1-H69 


31.4 


44.0 


85975 Breast Cancer 
(OD04D90-01) 


95.9 


Lung ca. (s.cell 
var.) SHP-77 


693 


90.5 


85976 Breast Cancer Mets 
(OD04590-03) 


61.1 


Lung ca. (large 
cell)NCI-H460 


9.9 


63.6 


87070 Breast Cancer 

■% r . . ■ //-\t\A A /* 1" ^ f\ C\ 

Metastasis (OD04655-05) 


38.4 


Lung ca. (non-sm. 
cell) A549 


20.9 


25.9 


GENPAK Breast Cancer 
064006 


332 


Lung ca. (non- 
s.cell) NCI-H23 


11.3 


10.7 


Breast Cancer Res. Gctl 
1024 


23.0 


Lung ca (non-s.cell) 
HOP-62 


. 


-T- 


Breasl Cancer Clontech 
91 00266 


33.4 


Lungca. (non-s.cl) 
NCI-H522 


29.1 i 30.0 


Breast NAT Clontech 
9100265 


19.5 


Umgca*(squam.) 
SW 900 


9.7 | 20.1 


Breast Cancer 
INVUKOGEN A209073 


47.0 


Lung ca. (squam.) 
NCI-H596 


9.5 [ 43.2 


Breast NAT INVTTROGEN 
A2090734 


37.6 


Mammary gland 


53 i 9.0 

i 


Normal Liver GENPAK 
061009 


15.5 


Breast ca * (pi. 
effusion) MCF-7 


14.3 


30.8 


Liver Cancer GENPAK 
064003 


14.0 


Breast ca * (pLef) 
MDA-MB-231 


42.3 


41.1 


Li ver Cancer Research 
Genetics RN A 1025 


20.4 


Breast ca * (pi. 
effiision) T47D 


9.6 


13.8 


Liver Cancer Research 
Genetics RNA 1026 


7.1 


Brcasl ca. 
BT-549 


100.0 


100.0 


Paiied Liver Cancer Tissue 
Research Genetics RNA 
6004-T 


24.8 


Breast ca. 

N /fT\ A -NT 


17.2 


8.8 


Paired Liver Tissue Research 1 8.0 
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jMDA-N 




[Genetics RNA 6004-N 




,Ovary 




0.9 jpaired Liver Cancer Tissue 
Research Genetics RNA 
'|6005-T 


ISA 


bvarian ca. 


11.3 


20.9 (Paired Liver Tissue Research 
(Genetics RNA 6005-N 


7.3 


Ovarian ca: 

\JX\sAl\rH 


2.5 


10.6 


Normal Bladder GENPAK 
061001 


59.9 


Ovarian ca. 

\J V wYtvO 


20.0 


26.3 


Bladder Lancer Research 
Genetics RNA 1023 


5.2 


Ovarian ca. 
OVCAR-8 


18.6 


16.5 


Bladder Cancer 
INVITROGliN A302173 


30.1 


Ovarian ca. 
IGROV-L 


6.9 


6.8 


87071 Bladder Cancer 
(OD0471S-01) 


65.5 


Ovarian ca.* 
(ascites) SK-OV-3 


32.3 


64.3 


87072 Bladder Normal 
Adjacent (OD04718-O3) 


18.4 


Uterus 


LS 


4.3 


Normal Ovary Res. Gen. 


5.1 


Planccnta 


2.2 


1.1 


Ovarian Cancer GKNPAK 
064008 


29.1 


Prostate 


2.0 


2.4 


87492 Ovary Cancer 
(OD04768-07) 


66.0 


Prostate ca.* (bone 
met) PC -3 


3.1 


3.2 


87493 Ovary NAT 
(OD04768-08) 


7.7 


Testis 


2.0 


1.5 


Normal Stomach GliNPAK 
061017 


16.4 


Melanoma 
ttsGooCAJ. 1 


4.6 


5.4 


Gastric Cancer Clnntech 
9060358 


3.5 


Melanoma* (met) 


2.4 


6.9 


NAT Stomach Clontech 
9060359 


10.5 
29.9~ 


Melanoma 


i.3 ! 12.6 


Gastric Cancer Clontech 
9060395 


Melanoma 

iVl 1 *fr 


5.9 


56.2 


NAT Stomach Clontech 
9060394 


12.9 


Melanoma 
LOXIMVI 


14.7 


8.1 


Gastric Cancer Clontech 
9060397 


42.3 


Melanoma* (met) 
SK-MEL-5 


16.2 


24.7 


NAT Stomach Clontech 
9060396 


6.2 


Adipose 


4.4 » 3.4 

i 


Gastric Cancer GENPAK 
064005 


31.0 



Table 34. Panels 3D and 4D 



PANEL 3D 




PANEL 4D 




Tissue Name 


Rel. Expr, 
% 

3dx4tm5121 
t ag2458 b 


Tissue Name 


Rel. Expr., 
% 

4dtm4272t 
ag2458 
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2 






94905 - Daoy_MeduUoblast 
oraa/CerebeUum_sscDNA 


13.6 


93768_Secondary Thljurti- 
CD28/anti-CD3 


14.5 


94906^TE671_MedullobIa 
stom/Cerebelhim sscDNA 


7.4 


93769_Secondary Hi2_anti- 
CD28/anti-CD3 


7.4 


94907JD283 

jMed^Medulloblastoiria/Cer 
ebellian sscDNA 


53.0 


93770_Secondary Trl_anti- 
CD28/anli-CD3 


9.5 


N908_PFSIC-l_Primitive 
NeuroectodetmaVCerebellu 
m sscDNA 


6.5 


93573_Secondaiy Thl jresting diiy 
4-6 in IL-2 


02 


94909 XF- 

498 CNS sscDNA 


6.8 


93572 Secondary Th2 jesting day 
4-6 in IL-2 


0.6 


94910 SNB- 

•78 CNS/glioma sscDNA 


9.8 


93571 Secondary Trl resting day 
4^6 in IL-2 


0.8 


,94911_SF- 

268_CNS/glioblastoma_ssc 
pNA 


102 


93568_primary Thl_anti- 
CD28/anti-CD3 


19.8 


94912J1^8G_Glioblastom 
a sscDNA 


15.7 


93569_priniary Th2 anti- 
CD28/anti-CD3 


13.0 


96776JSK-N- 
SH_Neuroblastoma 
(metastasis)^ sscDNA 


93570j>rimary Trl_anti- 
CD28/anti-CD3 


19.2 


94913_SF- 

295 CNS/gtioblastoma ssc 
DNA 


7.4 


93565jprimary Thljresting dy 4-6 
in IL-2 


8.8 


94914 Cerebellum sscDN 
A 


3.9 


93566 ^primary Th2_resting dy 4-6 
hi IL-2 


2.2 


96777 Cerebellum sscDN 
A 


0.8 


93567jmraary Trljresting dy 4-6 
in IL-2 


3.6 


94916_NCI- 
H292JVlucoepidermoid 
lung carcrnirma sscDNA 


46.9 


93351_CD45RACD4 
lyuaphocyte_anti-CD28/aiili-CD3 


19.5 


94917J)MS-114_STnall 
cell luiig cancer_sscDNA 


10.9 


93352_CD45ROCD4 
iyinpb^ytc_anti-CD28/anti-CD3 


17.7 


949l8JMS-79JSmall cell 
lung 

cancer/ueuroendocrine ssc 
DNA 


100.0 


93251_CD8 Lymphocytes_anti- 
CD28/anti-CD3 


9.3 


949I9_NCLH146_Small 
ceD lung 

cancer/neuToendocrtne_ssc 
DNA 


33.4 


93353_chronic CD8 Lymphocytes 
2ry_resting dy 4-6 in IL-2 


11.0 


94920_NCI-H526_Small 
cell lung 

cancer/neuroendocrine ssc 
DNA 


30.2 


93574_chronic CDS Lymphocytes 
2iyjictivated CD3/CD28 


5.5 


94921 J*CI-N417_SmalI 
cell lung 


26.1 


93354_CD4jione 


0.7 
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cancer/neurosEidocrine ssc 
DNA 








94923 JNa-H82_Smal1 
cell lung 

cancer/ncurocndocrine_ssc 
DNA 


28.4 


93252_Secondary 
TTil/Th2/Trl_aiiti-CD95 CHI 1 


0.9 


94924 NCI- 

H157JSquamous cell lung 
cancer 

(metastasis) sscDNA 


$8.0 


93 103_LAK. cells_resting 


15.8 


94925_NCJ-H1155J-arge 
cell lung 

canccr/neuroendocrine^ssc 
DNA 


31.3 


9378Sj^KceUsJL-2 


6.1 


94926 J«3-H1299JLarge 
cell lung 

cancetfneuroendocrine ssc 
DNA 


32.4 


93787_LAK ceDs_IL-2^IL-12 


8.8 


94927_NCI-H727_Lung 
carcinoid_sscDNA 


23.0 


93789_LAK cells_IL-2-IFN 
gamma 


11.7 


94928 JSO-UMC- 
ll_Lung 

carcinoid^sscDNA 


75.0 
~ T7.0~ " 


93790_LAK cells_IL-2-r IL-18 


132 


94929 JLX-l_SmaIl cell 
lung cancer sscDNA 


93104.LAK 

cells _PMAAonomycm and IL-18 


8.5 


94930_Colo-205_Colon 
cancer sscDNA 


11.1 


93578 JNK Cells IL-2_resting 


2.5 


94931_KM12_Colon 
cancer sscDNA , 


37.3 


93 1 09_Mixed Lymphocyte 
Reaction Two Way MLR 


17.0 


94932_KM20L2_Colon 
cancer sscDNA 


7.8 


931 lOJtfixed Lymphocyte 
Reaclion_T wo Way MLR 


10.2 


94933 jra*H7L6_Cblan 
cancer sscDNA 


32.3 


9311 l_Mixed Lymphucy te 
ReacbonJVo Way MLR 


7.4 


94935jSW-4S_Colon 
adenocarcinoma sscDNA 


7.8 


93 1 12_Mononnclear Cells 
(PBMCs)jresting 


2.4 


94936_SW1116_Colon 
adenocarcinoma sscDNA 


10.9 


931 ^Mononuclear Cells 
(PBMCs)_PWM 


23.7 


94937_LS 174T_Colon 
adenocarcinoma sscDNA 


29.3 


93114 Mononuclear Cells 
(PBMCs)_FHA-L 


9.6 


94938_SW-948_Colon 
adenocarcinoma sscDNA 


1.9 


93249_Ramos (B ceH)_none 


30.4 


94939JSW-4SCLColon 
adenocarcinoma sscDNA 


4.9 


93250J&amos {B ccll)_ionomycin 


100.0 


9494D_NCI-SNU- 
5J3astric 

carcinoma sscDNA 


8.3 


93349_B lymphocytes^PWM 


702 


9494i_KATO lOGasliic 
carcinoma sscDNA 


53.1 


93350 Brymphoytes CD40Land 
IL-4 


5.5 


94943JSO-SNU- 


7.3 


92665_EOL-l 


11.7 



204 



WO 01/81578 



PCT/US01/13S78 



16_Gastric 
carcinoma sscDNA 




(Jfciosinophil) - dbcAJMfc > 
differentiated 




94944_NCI-SNU- 
l_Gastric 

carcinoma sscDNA 


64.4 


93248_EOL-l 

(Eosinopliil)_dbcAMP/FMAiouoin 
yein 


6.3 


94946_RF-l_Gastric 
adenocarcinoma sscDNA 


11.4 


93356_Dendritic Cells_none 


12.4 


94947_RF-48_Gastric 
adenocarcinoma sscDNA 


15.4 


93355 Jtendritic CeIts_LPS 100 
ng'ml 


9.0 


96778 JVIKN-45_Gastric 
carcinoma sscDNA 


28.8 


93775_Dendritic Cells^anri-CD40 


12.5 


94949J^a-N87_Gastuc 
carcinoma sscDNA 


19.5 


93774_Mcmoc3i:es_resting 


15,2 


9495 1 J3VCAR-5 JDvorian 
carcinoma sscDNA 


11.7 


93776>lonocytesJ^S 50 ng/ml 


11.7 


94952 JU-95-2JJterine 
carcinoma sscDNA 


4.5 


9358 l_Macropliages_resting 


41.8 


94953_HelaS3_Cervical 
adenocarcinoma sscDNA 


11.3 


93582Jtfacmp1iagesJ.PS 100 
ngfrnl 


6.8 


94954_Ca Ski_Cervical 
epidermoid carcinoma 
(metasfasis)_sscDNA 


24.3 


93098JEUVEC 
(Endothclial)_none 


35.8 


94955 jaS-2_Ovarian clear 
cell carcmomasscDNA 


16.1 


93099_HUVEC 
(Endothelial), starved 


58.2 


94957Jtomos/6h sthnj'; 
Stimulated with 
PMAy'ionomycin 
6h sscDNA 


8.7 


93100 HUVEC (Endothelial) IL- 
lb 


16.5 


94958_Ramos/14h stimj; 
Stimulated with 
PMA/ionomycin 
14h sscDNA 


8.3 


93779_HUVEC (Endothelial) JFN 
gamma 


26.1 


94962_MEG-0 l_Chronic 
myelogenous leukemia 
(megokaryoblast)_ sscDNA 


22.1 


93102 JIUVEC 
(Endoihelial)JTNF alpha -r IFN 
gamma 


16.0 


94963 JXajiJiurkitfs 
lyinphomasscDNA 


8.3 


93101 IIUVEC 
(Endothelial) TNF alpha - IL4 


23.2 


94964_Daudi_Burkitfs 
lymphoma_sscDNA 


19.8 


93781 JHUVEC (Ehdothelial)_ILr 
11 


23.6 


94965 j;266J3-cell 
plasmacytoma/myeloma ss 
bDNA 


6.4 


93583_Limg Microvascular 
Endothelial CellBjione 


21.0 


94968_CA46_Burkitrs 
lymphoma_sscDKA 


6-5 


935 84_Lung Microvascular 
Endothelial Cells TNFa (4 ng/ml) 
and ILlb(l ng/ml) 


18.3 


^4970_RL_non-IIodgfciii's 
B-cell lymphomasscDNA 


9.0 


92662JVficiovascular Dermal 
endothelium none 


35.4 


94972 JMlj>re-B-cell 
lymphoma/leukemiasscD 


4.1 


92663_Microsvasular Deimal 
endothelium TNFa (4 ng/nxl) and 


20.0 
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NA 




ILlb (1 ng/ml) 




94973_Juikat_T cell 
leakeinia_sscDNA 


9.8 


93773_BroucWal 
epithdiamJTNFa (4 ng/ml) and 
ILlb (1 ng/mJ; v * 


14.9 


94974_TF- 

1 Erydatoleukeniia sscHN 
A 


31.5 


93347_Small Airway 
Bpimelium_none 


12. 


94975_HUT 78_T-cell 
lymphoma_sscDNA 


15.7 


93348_SmalI Airway 
Epithelium INra (4 ng/ml) and 
ILlb (1 ng/ml) 


45.4 


94977_U937_Histiocytic 
lymphomasscDNA 


30.5 


92668_Coronery Artery 
SMCjestmg 




94980_KU- 
8)2_Myelogcnous 
leukemia sscDNA 


21.7 


92669 Coroneiy Artery 
SMCJTNFa (4 ng/ml) and ILlb (1 
ng/tnJ) 


1 A ft 

10.8 


94981_769-P - Clear cell 
renal carcinoma sscDNA 


16.6 


93 1 07_asirocytes_restmg 


li.l 


94983_Caki-2_Clear cell 
renal carcinoma sscDNA 


16.8 


93 lOSjistrocytesJTNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


10.5 

I 


94984_SW 839_Clear cell 
renal carcinoma sscDNA 


11.8 


92666_KU-812 (BasopM) jesting 


18.8 


94986_G401_Wihns l 
tumor sscDNA 


16.5 


92667JCU-812 
(Basophil)_PMMonoycin 


27.9 

i 


94987JHs766TJ>ancreatic 
carcinoma (LN 
metastasis)_sscDNA 


12.9 


93579_CCD1106 
(Keratmocytes)_none 


20.9 


9498S_CAPAN- 
l_rancrcatic 
adenocarcinoma (liver 
metastasis)_sscDNA 


12.0 


935S0JXD1106 
(Kcratmoc>lBs)_TNFa and JFNg 


7.4 1 


949 89 jSU86.86_Pancreati 
c carcinoma (liver 
metastaais)_sscDNA 


32.6 


93 79 1 Liver CiTrnosis 


3.7 


94990JBxPC-3_PancTeauc 
adenocarcinoma sscDNA 


4.0 


93792_Lupus Kidney 


1.9 


94991JHPAC_Pancreatic 
adenocarcinoma sscDNA 


26.4 


93577_NCI-H292 


18.6 


94992_MIAPaCa- 
2_JPancreatic 
carcinoma sscDNA 


8.2 


93358_NCI-H292_IL-4 


33.7 


94993 CFPAC- 
1 JPancreatic ductal 
auenocarcmoma sscujna 


30.3 


93360_NCI-H292JL-9 


36.9 


94994 JPANC-l_Paiicrcatic 
epithelioid ductal 
carcinoma sscDNA 


26.7 , 


93359_Na-H292_IL-13 


20.0 


94996_T24_Bladder 
carcinma (transitional 
cell) sscDNA 


8.2 


93357_NCI-H292JFN gamma 


20.4 
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94997_:>637_Bladder 
carcinoma sscDNA 


20.5 


93777_HPAEC_- 


18.8 


94998JHT-1197_Bladder 
carcinoma sscDNA 


18.0 


93778_HPAEC_IL-1 beta/TNA 
alpha 


18.9 


94999 JJM-UC-3_Bladder 
caicinma (transitional 
cell)_sscDNA 


4.6 


93254_Nornial Human Lung 
Fibrobla3t_none 


9.5 


950C0_A204Jthabdomyos 
arconia_sscDNA 


20.0 


93253_Normal Human Lung 
FibroblastJTNFa (4 ng/ml) and IL- 
ib (1 ng/ml) 


3.7 j 

1 

1 


95001 JiT- 

1 080_FibTosarcoma_sscDN 
A 


15.4 


93257 Normal Unman Lung 
Fibroblast JL-A 


24.7 

i 


95002JVIG- 

63_Osteosarcoma 

(bone)_sscDNA 


16.2 


93256__Noraial Human Lung 
FibroblastJL-9 


192 ' 


95003JSK-LMS- 
^Leiomyosarcoma 
(vulva) sscDNA 


27.5 { 


93255_Normal Human Lnng 
FibroblastJL-13 


143 


9 5004_SJRH30_Rhabdom 
yosarcoma (met to bone 
mano\v)_sseDNA 


12.5 


93258_Normal Human Limg 
Fibtoblast_IFN gamma 


23.2 1 


950O5_A43 l_Epidcimoid 
carcinoma sscDNA 


6.9 


93106_Dennal Fibroblasts 
CCD1070_resting 


47.0 


95007JNM266- 
4_Melanoma_sscDNA 


7.5 


93361JDermal Fibroblasts 
CCD1 070_TNH alpha 4 ng/ml 


42.3 


95010_DU 145J>rostate 
carcinoma (brain 
metastasis)_sscDNA 


0.1 


93105_Demial Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


20.2 


95012JMDA-MB- 
468_Breast 

adenocarcinoma sscDNA 


13.5 


93772_dejrmal fibroblast,i>N 
gamma 


92 


950 1 3_SCC-4_Squainous 
cell carcinoma of 
tongue sscDNA 


1.3 


93771 jiermal fibroblast_JL-4 


22.1 


95G14_SCC-9JSquamous 
cell carcinoma of 
tongue_sscDNA 


0.3 


93259 JBD Colitis 1** 


1.4 


95015_SCC-1 5_Squainous 
cell carcinoma of 
tongue SSCUISA 


0.3 


9326QJBD Colitis 2 


1.1 


95017_CAL 27_Squamous 
ceD carcinoma of 
tonguc_sscDNA 


24.5 


93261 JBD Crohns 


1.0 






735010_Colon„normal 


4.1 






73501 9_Lung_none 


8.9 






64028-l_Thymus_none 


10.5 
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64030-l_Kidney_none 


3.6 



Panel L3D description: The gcneMOIDb is expressed in a number of tissues, including the central 
nervous system, lung, mammary gland and kidney. Moi*eover, its expression seems to be enhanced in 
tumor cell lines relative to normal tissue in most ernes with a good therapeutic window. 
5 Panel 2D description: Tissue distribution of the gene MOL9b m panel 2D confirms the results obtained in 
panel 1.3 D. There is enhanced expression of tliis gene (M0L9b) in tumor tissue relative to normal 
adjacent tissue, particularly in lung and kidney cancers, but also in cancers of colon, ovary, breast, 
gastric, bladder, liver and thyroid Some metastases, particularly htngjmtustasvs and those from 
melanoma express this gene M0L9b at particularly high levels. Corroborative information about the 
1 0 expression of thte molecule is available in the form ofESTs, mostly from endothelial cells, colon, ovarian 
tumors, pancreas and brain regions. Panel 3D for the gene MOL9h shows high level of expression in a 
variety of carcinomas, supporting results from panels L3D and 2D, and demonstrating utility for this 
protein as an antibody target. Tlierefore unUbvdies specific to this protein may be used as a therapeutic 
in the treatment of various types of cancer. 

15 

Panel 4D Description: The gene MOL9b is upreguiated in endothelium, and epithelium 
regardless of stimulus. There is up regulation of this molecule in an ionomycin treated B cell 
line and it is highly expressed in mitogen (PWM) treated B cells. Consistent with this finding, 
PBMCs treated wife PWM also up regulate this molecule. Induction of lliis gene MOL9b is also 
20 seen in activated T cells. In PBMC the T ceH specific mitogen PHA induces the expression of 
this transcript and in acute and chronically activated T cells the expression of this transcript is 
increased as compared to untreated or resting T ceils. This transcript is also expressed in resting 
macrophages. 

Potential Rolc(s) of in Inflammation: The molecule MOL9b is induced by B and T 
25 lymphocytes and may potentiate inflammation by regulating lymphocyte trafficking, or 
activation, or increasing tissue destruction. This molecule may also serve as a marker for 
activated T or B cells and may also be involved in the differentiation of monocytes into 
macrophages (see reference). Macrophages also participate in inflammation by producing 
multiple biologically active proteins like cytokines, activating other cells wilhin the local 
30 mkroenvironment, and ingesting dead and dying cells. 

Impact of Therapeutic Targeting of MOL9b; Small molecule or antibody therapies to the 
molecule encoded for by MOL9b may reduce or eliminate inflammation and tissue damage due 
to T or B cell activation or macrophages and the bioactive molecules produced by these cell 
types, These diseases would include asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. 
35 Alternatively, protein therapeutics based on this molecule could serve as an adjuvant and help 
boost the effectiveness of vaccines or regulate immune status during organ transplant. MOL9b 
also serves as a marker for activated T and B cells and serve as a diagnostic tool for determining 
indirectly measuring the extent of inflammation due to autoimmune diseases which induce T or 
B cells activation such as asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. Elevated 
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expression of flic gene MOL9b in macrophages may help in distinguishing resting macrophages 
from monocytes , dendritic cells. 

MOLlOa 

Expression of gene MOLlOa was assessed using the primer-probe set Agl 129. described 
5 in Table 35. Results of the RTQ-PCR runs are shown hx Table 36. 



Tabic 35. ProTic name: Ag1 129 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


5'-CTAGACCAGCAGCTGGATGAT-3> 


59.S 


21 


1518 


103 


Probe 


TET-5'- 

CTACAGACCAAGTTTGCTCGCCTCCT- 
3'-TAMRA 


68.7 


26 


1539 


104 


| Reverse 


5'-CAATGCGGTAAGCAATCTTAAG-3' 


59 


22 


1587 


105 



10 Tabic 36. Panels 1.3D and 4D 



PANEL 13D 




PANEL 4D 




Tissue Name 


Rel. Expr., 
% 

1.3dx4tm57 
7 It agl 129 
a2 


Tissue Name 


Rel. Expr., 
% 

4Dbnl984t 
_agU29 


Liver 

adenocarcinoma 


3.9 


93768 Secondary Thl anti-CD28/anti- 
CD3 


0.0 


Pancreas 


3.7 


93769_Secondary Th2_anti-CD28/anti- 
CD3 


9.0 


Pancreatic ca. 
CAP AN 2 


2.8 


93770 Secondary Trl anti-CD28/anti- 
CD3 


9.0 


Adrenal gland 


1.8 


93573jSecondary Thl_resting day 4-6 
mIL-2 


0.0 


Thyroid 


3.2 


93572_Secondary Tb2_rcating day 4-6 
inIL-2 


0.0 


Salivary gland 


3.8 | 


93571 Secondary Trl resting day 4-6 in 
IL-2 


0.0 j 


Pituitary gland 


2.6 


93568_primary Thl_anti-CD28/anti- 
CD3 ~ 


0.0 


Brain (fetal) 


21.9 


93569jprimaryTh2 anti-CD2S/anti- 
CD3 


0.0 


Brain (whole) 


14.6 


93570 jprimary Trl_anti-CD28/anti- 
CD3 


8.9 



209 



WO 01/81578 PCTAJS01/13578 



Braiu (amygdala) 


13.0 


93565_primary Thl_resting dy 4-6 in 

IT T 


6.8 


Brain (cerebellum) 


1.8 


93566 jrimaiy Tb2_resting dy 4-6 in 

IT 1 


14.6 


Brain (hippocampus) 


13.7 


93567 jrimary Trl ^resting dy 4-6 in IL- 


0.0 


Brain (substantia 
nigra) 


21.4 


93351_CD45RA CD4 lymphocyte_anti- 


5.9 


Brain (thalamus) 


10.3 


93352_CD45RO CD4 tymphocyte_anti- 

V^i l_/Z>0; till U V^-L/J 


14.3 


Cerebral Cortex 


10.5 


93251_CD8 Lymphocytes_ant> 

^ LJJ. 0/ oil li-V^ 


33.0 


Spinal cord 


8.1 


93353_chromc CDS Lymphocytes 

£/ry resting ny h-o in 


42 


CNS ca. (giin/astro) 

Uo /rLVLLi 


6.0 


93574_chronic CD8 Lymphocytes 
zry_acuvaica luj/lu4.o 


0.0 


CNSca, (glio/astro) 

U-l Io-JyUj 


0.0 


93354 j:W_none 


1-4 


CNS ca. (astro) 


3.5 


93252_Secondaiy ThlyTh2/Trl_anti- 


~ 20.2" 


CNS ca * (neuro; met 

"\ QJT \T AC 


0.0 


93103JLAK. cells_resting 


8.7 


CNS ca. (astro) 


1.7 


93788_LAKcells_IL-2 


30.8 


CNS ca. (astro) 

\NH-/j 


0.0 


93787_LAK cells_IL-2+-IL-12 


15.6 


CNS ca. (glio) 
SNB-19 


0.9 


93789_LAK ceIls_IL-2+IFN gamma 




CNS ca. (glio) 
U25l 


1.4 


93790_LAK cells_IL-2+ IL-18 


20.6 


CNS ca. (glio) 
SF-295 


0.0 


93104 LAK cells PMAtfonomycm and 
IL-18 


0.0 


Heart (fetal) 


0.0 


93578_NK Cells IL-2_resting 


5.9 


Heart 


3.3 


931 09_Mixed Lymphocyte 

ivcaCTiOn^ 1 WO W ay lYJULJts. 


12.9 


Petal Skeletal 


2.0 


93110JVlixed Lymphocyte 
Keacuon i wo way-LVLkiv 


3.5 


Skeletal muscle 


2.3 


93 1 1 l_Mixed Lymphocyte 
jveacnon_iwo w ay ivlliv 


6.8 


Bone marrow 


4.0 


93 1 12_Mononuclear Cells 
IPBMCs) resting 


17.2 


Thymus 


3.9 


93113 Mononuclear Cells 
(PBMCs) PW 


32.5 


Spleen 


0.0 


93114 Mononuclear CeDs 
(PBMCs)J»HA-L 


20.3 


Lymph node 


1.7 


93249_Ramos (B cell)_none 


22.2 


Colorectal 


1.4 


93250_Ramos (B ccll)_ionomycin 


68.3 
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Stomach 


0.0 


93349_B lymphocytes_PWM 


33.7 


Small intestine 


0.0 


93350_B lympboytes_CD40L and IL-4 


9.4 


Colon ca. 
STW4R0 


1.5 


92665_EOL-l (Eosinophil)_dbcAMP 
differentiated 


15.3 


Colon ca.*(SW480 


2.0 


93248_EOL-l 


0.0 


Colon ca. 

WT9Q 


22 


93356_Dendritic Ceils_none 


0.0 


Culon ca. 

i -j i t> 


3.7 


93355 J3endritic Cells_LFS 100 ng/nil 


6.7 


Colon ca. 


0.6 


93775_Dendritic Cells_anti-CD40 


0.0 


$3219 CC Well to 
ModDiff{OD03866) 


5.2 


93774_M° n ocyte s Jrestin.g 


20.6 


Colon ca 
IICC-2998 


5.1 


93776 Monocytes LPS 50 net/ml 


15.8 


Gastric ca.* (Kver 
met)NCI-N87 


10.0 


9358 1 _Macrophages_resting 


15.9 


Bladder 


4.6 


93582_Macrophages - LPS 100 ng/ml 


64.2 


Trachea 


30.4 


93098_HUVEC (Endothelial)_none 


~ 3.0 


Kidney 


3.3 


93099 HUYEC (EndoifaeliaH starved 


0.0 


Kidney (fetal) 


7.3 


93100_HUVEC (Endothelial)JL-lb 


3.5 


Renal ca. 

/oD-U 


0.0 


93779_HUYEC (Endothelial)IFN 
gamma 


6.7 


Renal ca. 


0.0 


93102_HUVEC (Endothelial)_TKF 
ajpna *r it jn ^arnrna 


0.0 


Renal ca. 
RXF393 


4.4 


93101 JTOVEC (Endothelial)_TNF 
alpha + IL4 


8.1 


Renal ca. 
ACHN 


u.u 


/oi_riu vtu ^jincioiiieiiai < )_ii--i i 


A A 
U.U 


Renal ca. 
UO-31 


U.U 


yijoJ JtAing Mjcrovasciuar Endothelial 
Cells none 


O/l A 

Z4.U 


Renal ca. 


5.9 


93584JLung Microvascular Endotlielial 
(jeii£_ii\ira (4 ng/mij ana U-.1D 
ng/ral) 


17.8 


Liver 




yzooz_i>iicru vascular ucrmaj 
endothelium none 




Liver (fetal) 


0.0 


92663_Microsvasular Dermal 
endothclium_TNFa (4 ng/ml) and IL1 b 
(1 ng/ml) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


93773 Bronchial epithelium TNFa (4 
ng/ml) and ILlb (1 ng/ml) ** 


79.6 


Lung 


30.2 


93347_Small Airway Epithelium_none 


0.0 


Lung (letal) 


100.0 


93348_SmaU Airway EpitheliumJTOFa 
(4 ng/ml) and ILlb (1 ng/'ml) 


53.6 
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Lung ca. (small cell) 
La-1 


1.9 


92668_Coronery Artery SMC_resting 


0.0 


Lung ca. (small cell) 
NCI-H69 


9.8 


92669_Coronery Artery SMCJTNFa (4 
ngtal) and ILlb (3 TL&mL) 


0.0 | 


Lung ca. (sxell var.) 

OTTT4 

SHP-77 


9.5 


93 1 07_astrocytes_resting 


0.0 


Lung ca. (large 
cell)NCI-H46(> 


0.0 


93 108_astrocytes_TNFa (4 ng/ml) and 
ILlb (1 ngtal) 


19.6 


Lung ca. (non-sin. 
cell) A549 


i.5 


92666_KU-812 (BasophU)_resting 


0.0 


Lungca. (non-s.cell) 
NCI-H23 


1.8 


9266"7_KU-812 
(Basophil)_PMAyionoycm 


15.1 


Lungca (non-s.cell) 
HOP-62 


0.8 


93579JXD1 1 06 (Keratinocytes)_none 


0.0 


Lung ca. (non-s.cl) 
NC1-H522 


8.7 


935SO_CCD1106 

(Kerathiocytes)_TNFa and IFNg ** 


4.7 


Lung ca. (squam.) 
SW 900 


1.S 


93791JJver Cirrhosis 


47.6 


Lung ca. (squam.) 
NCI-H596 


6.7 


93792 JLupus Kidney 


25.0 


Mammary gland 


8.0 


93577_NCI-H292 


17.0 


Breast ca.*(pl. 
effusion) MCF-7 


0.0 


93358 J*CI-H292_IL-4 


4.8 


Breast ca* (pl.ef) 
MDA-MB-231 


0.0 


93360_NQ-H292^IL-9 


12.9 


Breast ca.* (pi. 
effusion) T47D 


0.0~ " 


p3359JNCI-H292JL-13 


0.0 


Breast ca. 
BT-549 


0.0 


933S7JNCI-m92 JFN gamma 


4.4 


Breast ca. 
MDA-N 


0.0 


93777JHPAEC / 


0.0 


Ovary 


2.3 


93778 JIPAECJL-1 beta/TNA alpha 


6.9 


Ovarian ca. 
OVCAR-3 


0.0 


93254_Nonnal Human Lung 
Fibroblast none 


0.0 


Ovarian ca. 
OVCAR-4 


1.2 


93253_Norrnal Human Lung 
Fibroblast JHSTa (4 ng/ml) and IL-lb (1 

l TV 

ng/ml) 


62 


Ovarian ca. 
OVCAR-5 


7.7 


93257_Nonnal Human Lung 
Fibroblast IL-4 


6.9 


Ovarian ca. 

UVLM-k 


0.0 


93256_Nonnal Human Lung 

T * , .-,.1.. ] 1 o i TT O 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


93255_Nomial Human Lung 
Fibroblast IL-13 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


bT~~ 


93258_Normal Human Lung 
Fibroblast IFN gamma 


0.0 


Uterus 


1.9 


93 l06_Deimal Fibroblasts 
CCD1070 resting 


12.9 
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Planccnta 


0.0 


93361 JDcrmal Fibroblasts 
CCT>1070JTNF alpha 4 ng/ml 


28.5 


Prostate 


4.9 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


0.0 


Prostate ea.* (bone 
uiet)PC-3 


4.4 


93772 jdcrmal fibroblastJOFN gamma 


9.8 


Testis 


57.0 


93771 jiermal SbroblastJL-4 


2.6 


Melanoma 


0.0 


93259 JBD Colitis 1** 


88.3 


Melanoma* (met) 


0.0 


93260JBD Colitis 2 


0.0 


Melanoma 

IT ACT 1 AO 


0.0 


93261 JBDCrohns 


0.0 


Melanoma 
M14 


0.0 


73501 O^Colonnorniai 


20.2 


Melanoma 
LOXIMVI 


3.6 


73501 9_Lung_none 


29.7 


Melanoma* (met) 
SK-MEL-5 


3.3 


64028- IJIliymus^none 


100.0 


Adipose 


3.2 


64030O_Kidney_none 


10.0 



The gene MOLlOa show high levels in the testis and in fetal lung in panel 1 .3D. This 
indicates that this gene may be used for regeneration therapy in the lung and may also play a role 
in male fertility. The profile in panel 4D shows high expression in thymus with low to 
5 undetectable expression in other tissues and cell lines (Ct values >35). Therefore this gene may 
be involved in T-cell development and may be a marker for immature T cells. 



EXAMPLE 2: SOT ANALYSIS OF MOL6A 

Variant sequences are included in this application. A variant sequence can include a 
10 single nucleotide polymorphism (SNP). A SNP can, in some instances, be referred to as a 

VSNP" to denote that the nucleotide sequence containing the SNP originates as a cDNA. A SNT 
can arise in several ways. For example, a SNP may be due to a substitution of one nucleotide for 
another at the polymorphic site. Such a substitution can be either a transition or a transversion. A 
SNP can also arise from a deletion of a nucleotide or an insertion of a nucleotide, relative to a 
15 reference allele. In (his ca&e, the polymorphic site is a site at which one allele bears a gap with 
respect to a particular nucleotide in another allele. SNPs occurring within genes may result m an 
alteration of the amino acid encoded by the gene at the position of the SNP. Intragenic SNPs 
may also be silent, however, in the case that a codon including a SNP encodes the same amino 
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acid as a result of the redundancy of the genetic oode. SNPs occurring outside the region of a 
gene, or in an intron within a gene, do not result in changes in any amino acid sequence of a 
protein but may result in altered regulation of the expression pattern for example, alteration in 
temporal expression, physiological response regulation, cell type expression regulation, intensity 
5 of expression, stability of transcribed message. 

Method of novel SNP Identification: SNPs are identified by analyzing sequence 
assemblies using CuraGen's proprietary SNPTool algorithm, SNPTool identifies variation in 
assemblies with the following criteria: SNPs are not analyzed within 10 base pairs on both ends 
of an alignment; Window si2c (number of bases in a view) is 10; Hie allowed number of 

10 mismatches in a window is 2; Minimum SNP base quality (PHRED score) is 23; Minimum 

number of changes to score an SNP is 2/assembly position. SNPTool analyzes the assembly and 
displays SNP positions, associated individual variant sequences in the assembly, the depth of the 
assembly at that given position, Hie putative assembly allele frequency, and the SNP sequence 
variation. Sequence traces are then selected and brought into view for manual validation. The 

1 5 consensus assembly sequence is imported into CuraTools along with variant sequence changes to 
identify potential amino acid changes resulting from the SNP sequence variation. 
Comprehensive SNP data analysis is then exported into the SNPCalling database. 

Method of novel SNP Confirmation: SNPs are confirmed employing a validated 
method know as Pyrosequencing (See Aldeibom el al. Determination of Single Nucleotide 

20 Polymorphisms by Real-time Pyrophosphate DNA Sequencing, (2000). Genome Research. 10, 
Issue 8, August. 1249-1265). In brief, Pyrosequencing is a real time primer extension process of 
genotyping. This protocol takes double-stranded, biotinylatcd PCR products from genomic 
DNA samples and binds them to streptavidin beads. These beads are then denatured producing 
single stranded bound DNA. SNPs are characterized utilizing a technique based on an indirect 

25 bioluminometric assay of pyrophosphate (PP,) that is released from each dNTP upon DNA chain 
elongation Following Klenow polymerase-mediated base incorporation, PPi is released and used 
as a substrate, together with adenosine 5-phosphosulfate (APS), for ATP sulfiirylase, which 
results in the formation of ATP. Subsequently, the ATP accomplishes the conversion of luciferin 
to its oxi-derivative by the action of luciferase. The ensuing light output becomes proportional to 

30 the number of added bases, up to about four bases. To allow processivity of the method dNT P 
excess is degraded by apyiase, which is also present in the starting reaction mixture, so that only 
dNTPs are added to the template during the sequencing. The process has been fully automated 
and adapted to a 96-well format, which allows rapid screening of large SNP panels. 
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The DNA and protein sequences for the novel single nucleotide polymorphic variants of 
the Trypsin -like gene of MOL6a (GM87760758__A) are reported in Table 37. Variants are 
Teported individually "but any combination of all or a select subset of variants are also included. 
In Table 37, the positions of the variant bases and the variant amino acid residues are underlined. 
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Table 37. 

A. Variant 13373750 of MOL6a nucleotide sequence (SEQ ID NO. 11). 

CtoA at position 360. 

B. Nucleotide sequence of variant at position 360. 

5 lTACCATG&AATATGTCTTCTAT^ 

SlCTSCTCCCTm'TCSTGTACCTC;^ 
ISlCa^GCTCAC^TGCTATTTACC^^ 

2 4 1AAFTAACCC CAT TCAGATCGT CCGCT AC TGGAACTACAGT C ATAGOGCCCCACAGGATGAC CT CATGCTCATC&AGCTGG 

3 2 1 CTAAGC CTOCCATGCTC AATCCC A A AGT CCAGCC CCT7ACC CTCGCC ACCACC AATCTCftGG C CAGOCACTGT CTGT CCA 
10 40 lCrCT CAGGTTTGGACZXSC^GCCAAGAAAACAG 

4 8 1TGATCGAGAATGCCAAAAAAACAGAACAAGGAAAAAGCCACAG3AATTCCTTO 

5 6 1 QGAATT TTTGGGGAGOTGGCCGTTGCTACTGT CATCTGCAA AGACAAGCTCCAGGGAATCGAGG'TGGGG CACTTCATGGG 
641^3C^CGTC^CATC^CACCAATGTTTAa^^ 

721A (SEQ ID KTO-IOS) 

15 



C. Protein Sequence of variant at position 119, 



8 lNPIQIVRYWinrSHSAPODDLMLI KLAKP2\*njNKTC\TQPLTIA , rrNVRPGTV CIiL SGIiD WSQENSGRHPEIjR^JIjEAPWISD 
lfflRECEKttRTSX^QRF^MCEXra^ [SEQ ID 

20 HO. 107) 



D. Effect of variant on amino add residue 
Pro to Thr 



25 

EQUIVALENTS 

Although particular embodiments have been disclosed herein in detail, this has been done 
by way of example for purposes of illustration only, and is not intended to be limiting with 
respect to the scope of the appended claims, which follow. In particular, it is contemplated by 

30 the inventors that various substitutions, alterations, and modifications may be made to the 

invention without departing from the spirit and scope of the invention as defined by the claims. 
The choice of nucleic acid starting material, clone of interest, or library type is believed to be a 
matter of routine for a person of ordinary skill in the art with knowledge of the embodiments 
described herein. Other aspects, advantages, and modifications considered to be within the scope 

35 of the following claims. 
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WHAT IS CLAIMED TS: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18 9 20, 22, 24, 26, 28, and 30; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, wherein one or more amino acid residues in said variant differs from the 
ammo acid sequence of said mature form, provided that said variant differs in no 
more than 15% of the ammo acid residues from the amino acid sequence of said 
mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; and 

(d) a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 1 5% of aifrino acid 
residues from said amino acid sequence. 

2 The polypeptide of claim 1 , wherein said polypeptide comprises the amino add sequence 
of a naturally-occuiring allelic variant of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. 

3, The polypeptide of claim 2, wherein said allelic variant comprises an amino acid 
sequence that is the translation of a nucleic acid sequence differing by a single nucleotide 
from a nucleic acid sequence selected from the group consisting of SEQ ID NOS:l, 3, 5, 
7, 9, 1 1, 13, 15, 17, 19, 2L 23, 25, 27, and 29. 

4. The polypeptide of claim 1 , wherein the amino acid sequence of said variant comprises a 
conservative amino acid substitution. 
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5. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding a 
polypeptide comprising an amino acid sequence selected from the group consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NOS:2 s 4, 6, 8, 10, 12, 14, 16, 18, 20 , 22, 24, 26, 28, and 30; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ IDNOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, wherein one or more amino acid residues in said variant differs from the 
amino acid sequence of said mature form, provided that said variant differs in no 
more than 1 5% of the amino acid residues from the amino acid sequence of said 
mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20 : 22, 24 : 26, 28, and 30; 

(d) a variant of an ammo acid sequence selected from the group consisting SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 1 5% of amino acid ' 
residues from said amino acid sequence; 

(e) a nucleic acid fragment encoding at least a portion of a polypeptide comprising an 
amino acid sequence chosen from the group consisting of SEQ ED NOS:2, 4, 6, 8, 
10, 12, 14, 16, 18, 20,22,24, 26, 28, and 30, ora variant of said polypeptide, 
wherein one or more amino acid residues in said variant differs from the amino 
acid sequence of said mature form, provided that said vmmt differs in no more 
than 15% of amino acid residues from said amino acid sequence; and 

(f) a nucleic acid molecule comprising the complement of (a), (b), (c), (d) or (e). 

6. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises the 
nucleotide sequence of a natumlly-occumng allelic nucleic acid variant. 

7. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule encodes a 
polypeptide comprising the amino acid sequence of a naturalfy-occurring polypeptide 
variant. 
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8. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs by a 
single nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
IDNOS:!, 3, 5, 7, 9 f 11, 13, 15, 17, 19,21, 23, 25, 27, and 29. 

9. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting o£ 

(a) a nucleotide sequence selected from the group consisting of SEQ ID NOSrl, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29; 

(b) a nucleotide sequence differing by one or more nucleotides from a nucleotide 
sequence selected from the group consisting of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13, 
15, 17, 19, 21. 23, 25, 27, and 29, provided that no more than 20% of the 
nucleotides differ from said nucleotide sequence; 

(c) a nucleic acid fragment of (a); and 
• (d) a nucleic acid fragment of (b). 

10. The nucleic acid molecule of claim 5, wherein sfcid nucleic acid molecule hybridizes 
under stringent conditions to a nucleotide sequence chosen from the group consisting of 
SEQ ID NOSrl , 3, 5, 7, 9, 1 1, 13, 1 5, 17, 1 9, 21, 23, 25, 27, and 29, or a complement of 
said nucleotide sequence. 

11. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence comprising a coding sequence differing by one or 
more nucleotide sequences from a coding sequence encoding said amino acid 
sequence, provided that no more than 20% of the nucleotides in the coding 
sequence in said first nucleotide sequence differ from said coding sequence; 

(b) an isolated second polynucleotide that is a complement of the first polynucleotide; 
and 

(c) a nucleic acid fragment of (a) or (b). 

12. A vector comprising the nucleic acid molecule of claim 1 1 . 

13. The vector of claim 12, further comprising a promoter operably-linlced to said nucleic 
acid molecule. 
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14. A cell comprising the vector of claim 1 2. 

15. An antibody that binds inimunospecifically to the polypeptide of claim 1 . 

1 6. The antibody of claim 1 5, wherein said antibody is a monoclonal antibody. 

1 7. The antibody of claim 1 5, wherein the antibody is a humanized antibody. 

18. A method for determining die presence or amount of the polypeptide of claim 1 in a 
sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 

19. A method for determining the presence or amount of the nucleic acid molecule of claim 5 
in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with a probe that binds to said nucleic acid molecule; and 

(c) determining the presence or amount of the probe bound to said nucleic acid 
molecule, 

{hereby determining the presence or amount of the nucleic acid molecule in said sample. 

20. The method of claim 19 wherein presence or amount of the nucleic acid molecule is used 
as a marker for cell or tissue type. 

21 . The method of claim 20 wherein the cell or tissue type is cancerous. 

22. A method of identifying an agent that binds to a polypeptide of claim i , the method 
comprising: 

(a) contacting said polypeptide with said agent; and 

(b) determining whether said agent binds to said polypeptide. 
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23. The method of claim 22 wherein the agent is a cellular receptor or a downstream effector. 

24. A method for identifying an agent that modulates the expression or activity of the 
polypeptide of claim 1, the method comprising: 

(a) providing a cell expressing said polypeptide: 

(b) contacting the cell with said agent, and 

(c) determining whether the agent modulates expression or activity of said 
polypeptide, 

whereby an alteration in expression or activity of said peptide indicates said agent modulates 
expression or activity of said polypeptide. 

25 . A method for modulating the activity of the polypeptide of claim 1 , the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a 
compound that binds to said polypeptide in an amount sufficient to modulate the activity 
of the polypeptide. 

26. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the 
polypeptide of claim 1 in an amount sufficient to treat or prevent said MOLX-associated 
disorder in said subject 

27. The method of claim 26 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

28. The method of claim 26 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

29. The method of claim 26, wherein said subject is a human. 

30. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the nucleic 
acid of claim 5 in an amount sufficient to treat or prevent said MOLX-associated disorder 
in said subject. 
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3 1 . The method of claim 3 0 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

32. The method of claim 30 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

33. The method of claim 30, wherein said subject is a human. 

34. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired die antibody 
of claim 15 m an amount sufficient to treat or prevent said MOLX-associated disorder tn 
said subject 

35. The method of claim 34 wherein the disorder is diabetes. 

36. The method of claim 34 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

37. The method of claim 34, wherein me subject is a human, 

38. A pharmaceutical composition comprising the polypeptide of claim 1 and a 
phaimacciiticaHy-acccptablc carrier. 

39. A pharmaceutical composition comprising the nucleic acid molec ule of claim 5 and a 
phaxmaceutically-acceptable carrier. 

40. A pharmaceutical composition comprising the antibody of claim 1 5 and a 
pharmaceutically-acceptable carrier, 

41 . A kit comprising in one or more containers, the pharmaceutical composition of claim 3 8. 

42. A kit comprising in one or more containers, the pharmaceutical composition of claim 39. 
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43. A kit comprising in one or more containers, the pharmaceutical composition of claim 40. 

44. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the polypeptide of claim 1 in a first mammalian subject, the method 
comprising: 

(a) measuring the level of expression of the polypeptide in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said polypeptide in the sample of step (a) to the amount 
of the polypeptide present in a control sample from a second mammalian subjeci 
known not to have, or not to be predisposed to, said disease; 

wherein an alteration in the expression level of the polypeptide m the first subject as compared to 
the control sample indicates the presence of or predisposition to said disease. 

45. The method of claim 44 wherein the predisposition is to cancers. 

46. A method for determining the presence of or predisposition to a disease associated with 
altered levels of Ihe nucleic acid molecule of claim 5 in a first mammalian subject, the 
method comprising: 

(a) measuring the amount of the nucleic acid in a sample from the first mammalian 
subject; and 

(b) comparing the amount of said nucleic acid in the sample of step (a) to the amount 
of the nucleic acid present in a control sample from a second mammalian subject 
known not to have or not be predisposed to, the disease; 

wherein an alteration m the level of the nucleic acid in the first subject as compared to the 
control sample indicates the presence of or predisposition io the disease. 

47. The method of claim 46 wherein the predisposition is to a cancer. 

48. A method of treating a pathological state in a mammal, the method comprising 
administering to Ihe mammal a polypeptide in an amount that is sufficient to alleviate the 
pathological state, wherein the polypeptide is a polypeptide having au amino acid 
sequence at least 95% identical to a polypeptide comprising an amino acid sequence of at 
least one ofSEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or a 
biologically active fragment thereof. 

223 



s 



WO 01/81578 



PCT/US01/13578 



49. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal the antibody of claim 15 in an amount sufficient to alleviate 
the pathological state. 

50. A method for the screening of a candidate substance interacting with an olfactory 
receptor polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or fragments or variants thereof, comprises the 
following steps: 

a) providing a polypeptide selected from the group consisting of the 
sequences of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, or a peptide fragment or a variant thereof, 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; and 

d) detecting the complexes formed between said polypeptide and said 
candidate substance. 

51. A method for the screening of ligand molecules interacting with an olfactoiy receptor 
polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, and 30, wherein said method comprises: 

a) providing a recombinant eukaryotic host cell containing a nucleic acid 
encoding a polypeptide selected from the group consisting of the polypeptides 
comprising the amino acid sequences SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, and 30; * 

b) preparing membrane extracts of said recombinant eukaryotic host cell; 

c) bringing into contact the membrane extracts prepared at step b) with a 
selected ligand molecule; and 

d) detecting the production level of second messengers metabolites. 

52. A method for the screening of ligand molecules interacting with an olfactory receptor 
polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, and 30, wherein said method comprises: 
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a) pioviding an adenovirus containing a nucleic acid encoding a polypeptide 
selected from the group consisting of polypeptides comprising the amino acid 
sequences SliQ IDNOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 2S, and 30; 

b) infecting an olfactory epithelium with said adenovirus; 

c) bringing into contact the olfactory epithelium t>) with a selected ligand 
molecule; and 

d) detecting the increase of the response to said ligand molecule. 
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As invention 1, but relating to the M0L2-specific SEQIDs 3 
+ 4. 
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As invention 1, but relating to the M0L3-specific SEQIDs 5 
+ 6. 
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As Invention 1, but relating to the M0L5-spec1f1c SEQIDs 9 + 
10. 
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As Invention 1, but relating to the H0L6-spec1f1c SEQIDs 
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